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[0001] This application claims the benefit of co-pending provisional patent application 
serial no. 60/247,271, filed November 10, 2000. 
Background of the Invention 

[0002] This invention relates to improvements in the process of developing new products 
and services, and the attendant activities of consumer research, market segmentation, 
design iteration and market testing, as well as the marketing of such products and 
services, through direct customer participation. The invention also relates to the process 
of collective decision making, which presents issues in many respect parallel to those 
encountered in the design and product development process. 

[0003] Early in human history, the distinctions between the designer, the manufacturer, 
and the user of an artifact simply did not exist. People made their tools for their own use, 
and built their dwellings in an unselfconscious process passed across generations. Later, 
as the various arts and crafts evolved, the artisan or craftsman embodying both design and 
manufacturing functions remained close to his customers. The small volumes involved 
and the largely custom nature of craft production meant that the product responded 
directly to the needs and wants of individual customers. The industrial revolution 
brought an increase in the division and specialization of labor, along with the attendant 
economies of scale and scope. As a result, the design and production functions became 
distinct, production volumes increased, and products became more standardized. A 
particular product usually now had to satisfy a larger group of customers. That trend took 
a major leap with Ford's development of mass production. 



1 



[0004] Many products today require for their design large groups of people with highly 
specialized skills and knowledge, often numbering in the thousands, and often spread 
across continents. Furthermore, the development lead-time for some products can easily 
stretch to many years (e.g., new generation aircraft). The complexity of these products 
5 and services, and of the processes used to develop them, is reflected in the organizational 
structure of the companies which design and make them. Within the typical product 
development organization, the stakeholders in a given product development project 
include such diverse departments as product planning, styling, engineering, sales and 
marketing, manufacturing, after-sales service, legal affairs, and more recently, members 

10 of outside part supply companies. Each of these departments or organizations has its own 
objectives, constraints, and performance measures, and its executives and managers their 
own goals and idiosyncrasies. These and other factors have conspired to increase 
dramatically the distance between the people who design products and services, and the 
customers who consume them, whether the distance is measured in terms of geography, 

15 time, and technical knowledge, or in terms of worldview, goals, and daily concerns. 
[0005] In the past, many product development organizations relied on a few powerful 
individuals in their design or marketing departments, or in their executive ranks. These 
individuals in turn relied on their knowledge of the market and the customer, on their 
understanding of the technological possibilities, and on their vision, judgment, 

20 experience, preferences, prejudices and biases. In recent years, as consumers have grown 
increasingly sophisticated and knowledgeable, and as markets have become increasingly 
fragmented, this job has become more difficult. 

[0006] More recently, companies have adopted flatter, less hierarchical organizational 
models, with decision making responsibility pushed lower through the ranks, and they 
25 have embraced a new focus on the "voice of the customer." This movement was intended 
to remind them that they are mere proxies for the ultimate consumer of the goods or 
services being designed, and that the needs and desires of the customer should be the 
paramount input to that process. 
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[0007] The process of going from the voice of the customer to a product or service that 
reflects it remains fraught with errors and the potential for distortions. The first source of 
error is in ascertaining the wants and needs of the customer; the second is in the process 
of translating that input into a decision, product, artifact or service without coloring and 
5 distorting it. Practitioners have developed and used several tools and techniques intended 
to assess the needs of the customer and to translate these needs into a product concept and 
into engineering requirements. These higher-level tools include Concept Engineering, and 
the House of Quality and Quality Function Deployment. A critical aspect of these higher 
level tools and methodologies is that they not only bring the product development team or 
1 0 organization closer to the customer, but also play an important role in creating a 
consensus between the different functions in the product development team, and in 
bringing the different parts of the organization together to work toward a common goal. 
PI In other words, whenever conflicts arise between different parts of the organization, they 

are supposed to be resolved by going back to the voice of the customer. If honest 
W 1 5 differences in interpreting the voice arise, the solution would be to seek clarification from 

IP the customer. These tools represent significant improvements in the product design and 

w development process, but remain cumbersome and difficult to use, as their protagonists 

H : point out, and they require significant commitments of time and effort on the part of their 

O users. 

u pi 

Jf 20 [0008] The tools and instruments that traditionally have been deployed by market 
M : researchers range from the highly qualitative methods borrowed from ethnography, such 

as open-ended interviewing, participant observation, and focus groups, to the highly 
popular quantitative statistical methods such as survey research and conjoint analysis. 
Some of these tools and techniques suffer from several shortcomings, which are detailed 
25 below. 

[0009] During the development of a new product or service, the design organization 
typically will undertake a number of market research studies. Early on during the project, 
these may be more qualitative in nature, intended to uncover latent needs, or to develop 
new ideas for products and services. Later, the research may be more focused, intended 



to obtain feedback from current or potential customers on certain features or attributes of 
the proposed product; these could rely on qualitative methods, a focus group for example, 
as well as quantitative ones, such as surveys or structured serial interviews. One problem 
with consumer clinics is that potential customers are typically shown, and asked to 
5 comment on, a limited number of alternatives. This is done in order to keep the cognitive 
demands on the participants at a reasonable level, as these clinics are generally limited to 
a period of less than two hours, including the time necessary for providing the 
participants with the background and contextual information necessary for properly 
assessing the designs presented to them. Another reason is that the designs shown to the 
10 participants are in the form of models or prototypes that are costly to produce. In the 

traditional consumer clinic, people are suddenly taken from the world of today and asked 
to comment on future designs that they had not previously seen and which have not had 
J* the time to sink in. 

O [0010] Furthermore, consumer clinics, in which new products are shown to participants 

e set 
s s • 

ifj I 15 who are asked to comment on them, assume that people have preexistent preferences that 
Jjj are well-developed and stable. They therefore assume that the attitude that the 

iy participants form upon seeing the new product are valid and reflect the attitudes they will 

y. have when (and if) the product goes on the market. Yet, it is well-known that in many 

IZ, cases, people's long-term disposition towards a product differs from their initial reaction. 

%y 20 For example, it is not uncommon for a consumer to feel initially that the styling of an 
It outgoing automobile model is more attractive than that of its newly introduced 

replacement, only to change his or her mind after a few weeks of seeing the newer one on 
the road. Conversely, it is not uncommon for people's assessment of the attractiveness of 
a new product to plummet after the novelty wears off. This phenomenon is probably a 
25 reflection of two countervailing human tendencies, the desire for novelty and variety 
seeking on the one hand, and the comfort of the familiar on the other. Due to their 
compressed format, Focus groups and consumer clinics are vulnerable to this 
phenomenon. 
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[0011] Another problem with clinics and focus groups has to do with the interpersonal 
dynamics that the situation entails. In general, group dynamics are desirable in the sense 
that the discussion that takes place between participants is the mechanism for generating 
data, and the desired output is the active sharing and comparison of the participants' 
5 experiences and opinions. Problems arise when one or a few strong individuals end up 
dominating and biasing the discussion. Another difficulty is finding participants who do 
not know one another. This is desirable in order to avoid having one participant choose a 
particular design simply because his or her friend also chose it. This situation arises often 
when the product or service being designed is targeted at a small group of users, or users 
10 who are all members of the same group, for example, designing a benefits package for the 
employees of one company. Similar problems arise when the potential customers for a 
product happen to be competitors, and therefore less willing to sit together and share their 
preferences. 

[0012] The interpersonal dynamics in the traditional focus group or consumer clinic are 
1 5 magnified when the designs being presented are radically novel or unusual. In such 
cases, many participants find it difficult to express their true opinions in front of the 
group. They find it safer to retreat to the safety of negative criticism. They tend to focus 
on what they find wrong with the design, instead of looking at the whole design and its 
potential benefits. Furthermore, it is well known that in the case of many products and 
20 services, consumer preferences vary geographically or ethnically. Southern California is 
considered to lead the rest of the country in automotive trends. Color preferences in the 
USA are different than those in France or China. For that reason, companies will 
generally hold consumer clinics in several different markets, each of which would be 
considered representative of a particular geographical area. This adds to the cost of using 
25 that format for eliciting consumer preferences. 

[0013] Conjoint Analysis is used to assess consumer preference for different choices of 
products and services. It is a multi-attribute utility or preference measurement technique 
that explicitly accounts for the subjective tradeoffs people make when deciding among 
alternatives with multiple features and benefits. In its basic form, Conjoint Analysis is a 
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decompositional technique: the parameters that measure the importance the decision 
maker ascribes to the different aspects of the product are derived, using statistical 
regression techniques, from the decision maker's evaluations of a number of full profile 
descriptions of the product or service. Conjoint Analysis has been used in a wide range 
5 of applications, from developing soaps and dietary supplements to improving the appeal 
of military careers within the Department of Defense. 

[0014] The first step in conducting a Conjoint exercise is to identify the relevant 
attributes of the product or service in question, and to identify the levels of interest for 
each attribute. This is typically based on previous experience with similar products, and 
10 on earlier qualitative research such as an open-ended interview or a focus group. As an 
example, in the case of an automobile study, engine displacement may be one attribute of 
interest, with 2.0, 2.5, and 3.0 liters the three levels to be tested; and body style may be 
q another attribute, with "sedan" and "coupe" as the levels of interest. Next, a number of 

[jj full-profile descriptions of potential products, that is, descriptions in which every attribute 

1 5 is represented by a value, usually using a highly fractionated factorial orthogonal design 
| n (i.e., only a small fraction of all possible product profiles are used in the test.) These 

w profiles are shown to the respondent, traditionally in the form of prop cards, and the 

H respondent is asked to rank them by order of preference or to rate each of them on an 

n interval scale, for example, from 0-100. The responses then are analyzed using statistical 

H the attribute levels, that is, the contribution of each attribute level to the overall 

preference level of a profile. Returning to the earlier example, it might turn out that for 
one particular respondent, a 2.0 liter engine has a part-worth of 0.0, the 2.5 liter a part- 
worth of 0.5, and so on; the "sedan" body style may have a part worth of 0.0, whereas the 
25 "coupe" style may have a value of 0.8. Once the part-worths for an individual are 

obtained in this way, it is then possible to search through all the possible combinations of 
attribute levels to synthesize the optimal product for that individual, that is, the product 
that would give him or her the highest possible level of utility, or that he or she would 
have the strongest intention of buying. 
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[0015] Conjoint Analysis studies typically are conducted with more than one individual, 
and part-worths are typically obtained for a representative sample of consumers. This 
multi-respondent data can be used for several purposes. One is to identify the product 
design that would result in the greatest market share for the product development 
5 organization, given the attributes of competing products on the market (current and 

expected; this is known as the "share-of-choices" problem. Another purpose is to identify 
the product design that would maximize overall consumer utility, that is, the sum of 
utilities across all the consumers; this is known as the "buyer's welfare" problem. 
Solving these search problems is a hard computationally; mathematically, these are 
10 known as NP-Hard problems, requiring heuristic dynamic programming procedures for 
their solution. More recently, the adaptive search techniques of Genetic and Evolutionary 
Computation, more specifically Genetic Algorithms (GAs), have been used more 
}1 effectively to find solutions to these problems. In that case, Conjoint Analysis data 

: 5S=? 

0 collected previously, using standard Conjoint Analysis techniques; in a separate and 

in 

iy 15 subsequent step, that data was subjected to the aforementioned search technique to find 

?2 the optimal solutions or designs. 

W [0016] Another purpose of collecting Conjoint data from a representative group of 

Mf participants is to identify distinct market segments with different preference profiles, 

Q This is done through cluster analysis, a statistical technique for finding subgroups of 

^ 20 respondents such that respondents within a subgroup value the different product attributes 

U similarly, but differently from respondents in other subgroups. Once clusters are 

identified, those that present significant commercial potential can be targeted with 

specific product designs. 

[0017] Conjoint Analysis offers two major advantages over other techniques. One 
25 advantage stems from its decompositional nature. 

[0018] Conjoint Analysis has shortcomings. The first is the tediousness of participating 
in the process as a respondent. Generally, the product designers and marketers, by virtue 
of their intimate involvement with and knowledge of the product, want to answer a large 
number of issues and test a large number of attributes. The customers on the other hand 
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are generally less engaged and reluctant to submit to lengthy questionnaires. And even 
though highly fractionated factorial designs are used (a research design that itself 
introduces serious shortcomings, as discussed later), respondents typically still asked to 
rate a considerable number of possibilities. For example, in a case where there are 12 
product attributes, with four different levels for each attribute, the respondent would face 
about 35 profiles. That number is often multiplied by a factor of 3 in order to reduce the 
effect of random errors, resulting in the respondent having to face over 100 questions. 
The laboriousness of the process often leads to confusion and loss of attention and focus 
on the part of the respondents, who often end up resorting to heuristics as a shortcut for 
getting through the questionnaire, (several example Conjoint exercises can be found on 
the World Wide Web; see, for example, www.coniointonline.com .) For example, instead 
of properly weighing all the attributes against one another, they only rely on one or two to 
make their decision, leading to inaccurate results. 

[0019] More recently, several modifications to Conjoint Analysis that aim to reduce the 
tediousness of the process, and the resulting inaccuracy of the results, have been proposed 
and used in practice. These hybrid techniques do not consist exclusively of full profiles 
of hypothetical products, as in conventional Conjoint Analysis, but they start off by 
asking the respondent a set of self-explication questions (non-conjoint questions that 
involve no trade-offs), and follow that with partial-profile descriptions. Examples of such 
techniques include Adaptive Conjoint Analysis and the newer Hierarchical Conjoint 
Analysis. 

[0020] In Adaptive Conjoint Analysis as implemented by Sawtooth Software (the most 
frequently used technique for commercial conjoint studies in both the United States and 
Europe, the survey starts by asking the respondent to eliminate those attribute levels that 
he or she would find unacceptable under any conditions. Those levels are no longer used 
in the subsequent part of the interview. Next, the respondent is asked to reduce the levels 
in each attribute to the 5 levels he or she is most likely to be interested in. The next step 
in the process asks the respondent to rate the importance of individual attributes; these 
ratings attempt to eliminate those attributes deemed unimportant, and to generate initial 
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estimates of the respondent's utilities, which subsequently are used to generate a set of 
customized paired-comparison questions using partial profiles. With each response, the 
estimates of the respondent's utilities are updated, and appropriate paired-comparison 
questions generated. These questions are designed to converge and focus on the subspace 
5 of attribute comparisons that appears most favored by the respondent based on the earlier 
responses, with the objective of refining the estimates of that respondent's trade-off 
profile within that limited subspace. 

[0021] Clearly, Adaptive Conjoint Analysis relies heavily on the self-explicated 
evaluation component of the questionnaire, where the decision maker is asked explicitly 
10 to indicate his attitude towards various attributes separately. A key assumption behind 
that method is that the respondent's attitudes and preferences are pre-existent and stable. 
Adaptive Conjoint relies on that assumption to quickly narrow the choices presented to 
the interviewee and reduce the workload imposed on him or her. Adaptive Conjoint thus 
precludes the possibility that the respondent might uncover or evolve new personal 
15 preferences or attribute trade-off profiles as he or she participates in the study. The 
problem with that approach is the danger of reification of any preconceived notions or 
^ partial, ill-formed preferences the respondent might have a priori, resulting in a sub 
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M optimal to the product design problem. In fact, users of Adaptive Conjoint Analysis are 

M 

□ warned against allowing respondents to eliminate attribute levels (the first step described 

]S 20 in the previous paragraph) "unless there is no other way to make an interview acceptably 

brief." 

[0022] A more recent development, Hierarchical Bayes Conjoint Analysis, improves on 
Adaptive Conjoint through the use of more robust and theoretically more defensible 
statistical methods. It does not however address the problem described above. 
25 Furthermore, Hierarchical Bayes Adaptive Conjoint Analysis relies on the responses of 
other participants in the study to improve the estimates of each individual's utilities; in 
other words, Hierarchical Bayes makes it possible to trade the number of the respondents 
surveyed with the workload on any individual respondent. It is highly computationally 
intensive procedure however, requiring several hours of running time on a typical 
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personal computer; it is therefore not very useful in a real-time online context. The 
existing software products perform the Hierarchical Bayes analysis of the data obtained 
through an Adaptive Conjoint study after the fact, offline. 

[0023] The second major shortcoming of Conjoint Analysis, one that is not addressed by 
any of the improved methodologies discussed above, stems from the assumption that the 
different product attributes are independent of one another. Conjoint Analysis is a "main 
effects only" model; it assumes there are no interactions among attributes. In the additive 
part-worths model that is used universally, an individual's preference for a particular 
product is assumed to consist of the sum of independent functions of the attribute levels 
in that product. Using an automotive example again, a consumer's preference for exterior 
color, bright red versus dark gray for example, is assumed not to depend on body style, 
whether the automobile in question is a sport coupe or a luxury sedan. Yet we know 
empirically that bright red is a more popular on sporty cars than it is on luxury sedans. If 
the researcher suspects that there may be some interaction between two attributes (based 
on product knowledge or from statistical analysis), the solution within the Conjoint 
Analysis framework is to define composite variables ("superattributes") that are a 
combination of the two interacting attributes. These super-attributes are given the levels 
formed by combining the individual attribute levels. Returning to the previous example, 
the composite attribute would be "color-body style", and it would take on four levels (two 
times two): "bright red sports coupe", "bright red luxury sedan", "dark grey sport coupe", 
and "dark grey luxury sedan". The problem with that work-around is that it is highly 
deleterious to the respondent workload. (It is after all the main-effects only aspect of 
conjoint that makes possible the highly fractionated factorial designs.) Instead of two 
attributes with two levels each, we now have three attributes with a total of eight levels. 
This combinatorial explosion is much more severe when a more realistic number of 
individual attribute levels is used: in the case of five colors and five body styles, we 
would go from 10 levels (5+5) to a total of 35 levels (5+5 + (5x5).) The number of 
parameters to be estimated by the Conjoint study, and therefore the number of questions 
respondents are subjected to, increase in proportion to the number of these levels. 
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[0024] The "main-effects only" nature of Conjoint Analysis has a more subtle and 
insidious effect, as it affects how many marketers and product developers come to think 
about their products and services. By relying on Conjoint Analysis to obtain the voice of 
the customer, they tend to design studies that use those attributes of the product which are 
more readily decomposable; and they present them in a way that makes it easy for the 
respondents to separate them. Respondents end up focusing on a few of these attributes, 
and using them heuristically (as mentioned earlier), and not performing the additional 
mental processing that would reveal possible interaction between attributes. The result is 
an artificially good fit to the additive partworths model, but poor predictive accuracy. 
[0025] More fundamentally, the very notion that a product or service can be adequately 
described to a consumer by .a set of attribute levels is itself problematic. Since it works 
by presenting decomposable stimuli to the respondent, Conjoint Analysis is particularly 
ill-suited for understanding how consumers evaluate important classes of products, 
namely, products that are perceived holistically by the consumer. Examples of such 
"unitary" products include, but are not limited to aesthetic objects, foods, fragrances, and 
music. Even though a perfume expert (known as a "nose" in the trade), upon smelling a 
scent, may be able to analyze it and describe its major attributes, that faculty is not 
available to the majority of perfume buyers. In such cases, where the respondent cannot 
break the stimulus presented to him or her into component parts or attributes, attempting 
to build simple models of the respondent's preference based on factorially designed 
studies is unlikely to succeed. 

[0026] By contrast, this invention does not require that the same factors used by the 
marketer or designer to alter the product be presented to the respondent to assess his or 
her preference. In the present invention, the respondent is presented by a stimulus that 
matches the way in which he or she perceives the particular product or service in real-life. 
Summary of the Invention 

[0027] In a generic sense, the invention provides methods of determining which of a large 
number of forms of a product, each of which has a plurality of alternative attributes or 
attribute values, is preferred by a "selector." A "selector," as used herein, is one or a 
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group of persons whose input is being considered during the course of the practice of a 
method of the invention. "Selector" may refer either to a collection of entities 
participating in an exercise, or a single person, or the individual entities participating in 
an exercise. A selector may be a focus group, a working group of designers and/or 
managers within a company or professional design service organization, a group of 
people representative of a target demographic group, members of a club or class 
dedicated to some activity or pursuit, enthusiasts who are potential customers for a given 
product such as dog owners, golfers, interior decorators, cyclists, homeowners, teen-aged 
boys, persons who are employed by a company or who work within an industry, etc. 
Persons acting as selectors have presented to them once or serially groups of, for 
example, two to a dozen or so different possible design forms. In the aspect of the 
invention referred to herein as the virtual salesperson, the selector is an individual, a 
purchase agent, or a small group such as a couple or a family. 
[0028] The selector also may comprise a group of persons engaged in a cooperative 
design of a product, such as a group of young women designing next spring's fashions, a 
professional industrial design group designing an automobile seat, a small group of 
architects designing a home for a client, or a group of musicians composing a piece of 
music. In this case, once a consensus for a design is reached the method may include the 
additional step of producing a plurality of units of a selected product form or a product 
resembling that form. When the selector is a group of persons, the derived group of 
product forms presented to a person in the group may be generated using data indicative 
of the preferences expressed by one or more other persons in the group. Also, the 
invention contemplates repeating the presentation of specific product forms within a 
particular derived group to one or more persons serving as the selector. 
[0029] "Preference," which may also be referred to as "affinity," as used herein indicates 
a selector's favor (or disfavor) for a particular item having a set of attributes. In one 
embodiment a positive affinity value indicates that the selector favors a particular item 
while a negative value indicates that the selector disfavors that item. 
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[0030] In the methods of the invention, the proposed designs are presented to the 
participants, and feedback from the latter is collected via, for example, individual 
personal computers connected in a network such as an intranet, an extranet, or the 
internet. It is accordingly possible to control the interpersonal dynamics among the 
participants. It is also possible to isolate them completely from one another, so that no 
one of them is aware of the preferences expressed by the others. It is also possible to 
allow selective levels of information to be shared among the participants, to initiate a real 
or virtual group discussion, to control the degree of social pressure they may feel, to 
satisfy a craving for information about the status or direction of the project, or for 
information about what products others have purchased. This could be used to mimic the 
network externalities that take place in real life, where some people tend to favor the 
same products that their peers are buying and consuming, while others may choose to take 
a contrarian attitude. This is important in such products as fashion apparel or accessories, 
investment instruments or portfolios, computer software, and so on. Furthermore, by 
connecting participants via a computer network, it is possible to assemble a group of 
participants that are located in very different geographical locales. The methods also 
facilitate time management, as they reduce the need to bring together all participants at 
the same time by seamlessly integrating data that is received at different points in time (in 
certain embodiments of the invention.). 

[0031] "Products", as used herein and explained more fully below, is intended to be a 
generic term referring to goods, such as objects intended to be mass produced, 
modularized goods such as personal computers which comprise a plurality of 
interchangeable parts suitable for mass customization, services, such as mutual funds or 
travel services, and plans, such as a written list of alternatives for governing future 
conduct of an individual or organization, such as a business plan or a menu of food items 
to be consumed by a group. 

[0032] "Attributes" of a product, as used herein, is intended to refer to the structural, 
functional, stylistic, or economic features of the product, service or plan and include 
things such as cost, color or color combination, size, strength, shape, style, pattern, 
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length, weight, content feature, theme, option, choice of material, softness, etc. The 
product attributes may be aesthetic or functional. A given product has a series of possible 
attributes that are combined using the method of the invention to develop a design. 
Different types of objects of the design or selection obviously will have different groups 
5 of possible attributes. Thus, for example, designs for an aesthetically pleasing exterior 
appearance of a hands-free telephone would have "attributes" such as material (e.g., 
plastic or metal), distribution of materials (e.g., plastic sides with metal top), texture, 
color, color combination, length, width, thickness, size of controls, shape of control, color 
of controls, position of controls, position of status lights, speaker grill pattern, etc. 
10 Designs for a billboard would have attributes such as dimension, aspect ratio, dominant 
color, background color, color scheme, size of print, presence or absence of pictorial 
material, various types of content for pictorial material, number of people in a scene, site 
h of the scene (big city, pastoral setting, domestic setting, dance hall), etc. 

[0033] The term "attribute 55 denotes both elements that are absolute, in the sense that they 

y s 

W 1 5 are either present in the product or not, and relative, in the sense that an attribute can have 

yi many values, or be broken down into many subtypes. In this respect, the meaning of 

m "attribute" as used herein is broader, and distinct from the term as used in the conjoint 

H analysis literature. An example of the former is the presence or absence of a clock in an 

h auto dashboard design or a collar on a dress design. An example of the latter is the radius 

.;S 20 or other measure of the degree of curvature on the bow of a boat hull design, or the 
^ reflectivity of the glass covering a building. 

[0034] Broadly, the invention involves generating and presenting, typically electronically, 
a number of design alternatives to persons who are participating in the design, selection, 
or market research exercise. The participants (referred to as "selectors") transmit data 
25 indicative of their preferences among or between the presented design alternatives, and 
that data is used to derive a new generation of design alternatives or proposals. The new 
designs are generated through the use of a computer program exploiting a genetic or 
evolutionary computational technique. The process is repeated, typically for many 
iterations or cycles. Depending on the purpose of the effort and how the method is 
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designed and run, it can be used in a number of new and useful ways. It can serve to 
design new products or services that are appealing to individual consumers or a targeted 
group of consumer, to facilitate group design efforts, to conduct market research in a 
better way than previously possible, e.g., probing the affinity of individual consumers, 
demographically defined groups of consumers, or consumers with a particular state of 
mind, for a given product or service. It can also be used to design a product or service 
that will appeal to a participating group, or to serve as a virtual salesperson, effectively 
facilitating a shopper's choice of what to buy. Stated differently, the invention permits an 
individual shopper to quickly make a rational selection of a product from a potentially 
vast number of similar products having various combinations of features and attributes. 
One advantage of the proposed invention is that the participants assess several design 
candidates over a number of successive iterations. This is particularly helpful in those 
design situations that involve novel or unusual styles, as is the case with apparel and 
automobile styling, to name two examples, where the initial exposure to such an unusual 
design may elicit initial reactions that are inaccurate.. 

[0035] The invention may exploit various ways to gather data indicative of preference 
and various ways to tabulate, filter or aggregate, and use that data. Thus, data obtained 
from a subset of the persons comprising the selector may be given a disproportionate 
influence on the generation of the derived group of product forms, i.e., discounted, 
elevated in importance, or ignored. The selector may be permitted to specify an attribute 
of said product before or during the iterations of derived groups. This may involve fixing 
the value of that attribute at a particular value, or preventing that attribute from taking on 
particular values that the participant finds undesirable. Before beginning the iterative 
selection/design process, the system may obtain certain preference information from the 
selector and may use at least a portion of the information obtained in such prescreening to 
constrain the subsequent generation of derived product forms. For example, such 
information may include the range of prices the seller is willing to pay for the product, 
selector body size information, product style information, color preference, material 
preference, a performance specification, or a list of selector desired product functions. 
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[0036] In one preferred embodiment, the method comprises the additional step of 
effecting a sale to the selector or a subset thereof (one or a group of persons comprising 
the selector) of a product based on a selected product form. The product may be 
produced for delivery to the selector by mass customization, i.e., by an organization 
dedicated to producing upon request any one of a large variety of forms of the product as 
dictated by a particular purchaser (such as bicycles, footwear, or clothing). Alternatively 
the product based on the selected product form may be a product which exists prior to its 
identification by the selector, e.g., is sold from an inventory. Practice of this aspect of the 
invention preferably involves a program that presents derived groups of product forms 
that a seller has available in inventory or can easily or profitably be made or purchased. 
Thus, in this aspect, the method functions as a virtual salesperson, one that discerns the 
selector's preferences and suggests alternatives that may be appealing to the selector 
based on the incoming captured data, and subtly influences the selector's choices by the 
sequence or selection of presented product forms. 

[0037] Viewed from another perspective, the invention comprises a computer-aided 
bridge between incompatible constituent elements of the language of the science of 
design, on the one hand, and the cognitive language and thought processes employed by 
consumers when they consider their preferences or consider a purchase. It is this 
dichotomy which heretofore has inhibited effective consumer input to design tasks, input 
that is truly reflective of their preferences. The design engine and virtual salesperson 
embodiments of the invention described herein essentially comprises a computer- 
mediated translation device, converting seamlessly and effectively the preferences of 
consumers, which often defy verbal description, into design-specific data specified 
through variables useful in implementing design. By allowing a consumer to evaluate an 
evolving set of whole designs, each of which incorporates aspects relative to that 
consumer's preference determination, the consumer is permitted to drive directly the 
design or product selection process without being familiar with specific design attributes 
or language. For instance, the curvature of an arm on an easy chair may be an attribute 
that affects the look of the char and the subjective aesthetic assessment of a consumer, but 
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often cannot be specified by a consumer ignorant of variables affecting chair design. The 
reason many people may say that they cannot specify what they like until they see it may 
be because consumers generally are untrained in the language of design. The "design 
engine" (defined later) seeks to overcome this underlying constraint as set forth herein. 
5 Designers who may not be knowledgeable about how a consumer actually evaluates a 
particular design can use the invention without being significantly disadvantaged. 
Similarly, consumers ignorant of design theory and principle can achieve a design that 
they like and that has a good chance to endure as a favorite. 

[0038] As noted generally above the preferred apparatus for implementing the methods of 
10 the invention comprise a network wherein the program resides in a server which is linked 
to plural terminals. The terminals employed in the apparatus may comprise a computer, a 
television, a telephone, a personal digital assistant, or other electronic device coupled 

fj wirelessly or via wires to a server. The apparatus most typically comprises a plurality of 

rl 

rS terminals. Of course, given the current state of the information technology art, other 

W 1 5 system architectures may be used to embody the system of the invention for 

yi implementing its various methodologies. 

[0039] The method may involve iterating the cycle of selection and derived product form 
f* generation a sufficient number of times to permit determination of one or a plurality of 

C3 product forms preferred by the selector. Particularly in a group design effort where the 

20 selector is a group of consumers, this may lead to the identification of more than one 
preferred design. Collecting demographic data about the selector and correlating the 
product forms preferred by the selector to the demographic data permits identification of 
market segments which may be exploited using differing strategies. Accordingly, the 
invention facilitates a new form of market research, in which its proprietor is enabled to 
25 discern the relative affinity of a consumer or group of consumers for a given product 
form, or to discern market segments, for example, early adapters, late majority, etc. 
[0040] The derived group of product forms next are presented to one or more persons 
comprising the selectors, who again input data indicative of their preferences, this time 
with respect to the new set of product forms, and the process is repeated until a stopping 
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criterion is met. The stopping criterion may be, for example, a decision to purchase made 
by the selector, the cycling of a predetermined number of iterations, the reaching of a 
consensus agreement on attributes by a plurality of persons comprising the selector; the 
participation of a predetermined number of persons comprising the selector; the 
achievement of a predetermined number of assessments, the passage of a predetermined 
time for conducting the exercise, the arrival of a point in time in the future, the 
intervention of a supervisor such as a person who judges that a good design has been 
achieved, the lack of improvement in emerging product forms as judged by a person 
comprising the selector or a supervisor, or a suitably programmed computer; the selection 
of a specific product form by a person comprising the selector, the convergence of all 
design alternatives generated by the evolutionary algorithm to a small enough number of 
possibilities (i.e., the loss of genetic diversity or the arrival of a certain level of similarity 
in the population of designs), or some combination thereof. 

[0041] Persons participating in the exercise making up the selectors will of course have 
preference profiles which may well evolve during a design cycle. The participant may be 
influenced by peer choice in group dynamics. Also, his or her preferences may be 
adjusted because he or she sees and thinks about alternatives in a more rigorous way then 
may otherwise be the case. Perhaps most significantly, participation in a design exercise 
by a person may well serve to increase that participant's confidence level in providing 
evaluations. Often, early generations of product alternatives may be fraught with low 
confidence evaluations. However, during the evolutionary design process, as the 
consumer's preferences are increasingly reflected in the design attributes, the consumer's 
own evaluations may well be made with a greater confidence. A similar phenomenon is 
that some consumers make purchase decisions more confidently if they have researched a 
product. Furthermore, inclusion of the consumer's design through repeated steps and the 
concentrated thinking about what really is his preference may well lead to a higher 
frequency of purchases than otherwise might be the case. Based on these behavioral 
insights, in accordance with the invention, in some embodiments it may be valuable to 
permit participants to input data indicative of the confidence they have in their preference 
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at least at some points in the iterative process. The level of confidence in a design as 
expressed by a participant can be used as a cycle stop criterion, at least with respect to a 
particular participant. 

[0042] The invention contemplates the use of a wide variety of programming techniques 
to aid in the achievement of the goals of a given exercise. Generally, many known 
computational techniques can be exploited in the design of computer programs useful in 
the methods and apparatus of the invention, and they can be adapted by the skilled 
programmer to achieve a given purpose. The preferred techniques are genetic or 
evolutionary computation techniques (GEC's) such as genetic algorithms, evolution 
strategies, distribution estimation algorithms, and genetic programming; other 
computational techniques the use of which is contemplated in the present invention 
include generative grammars, hill-climbing, simulated annealing, random search, a 
generator of random attribute values, statistical design of experiments techniques, or a 
combination thereof. Conjoint analysis techniques also may be used, e.g., in weighing of 
attributes of product forms derived from the expressed preferences. When this type of 
analysis is used in combination with genetic or evolutionary computation techniques it is 
possible to decrease the number of iterations needed in a given exercise to obtain the 
desired information. 

[0043] The program may execute a genetic algorithm operation, an evolution strategy 
operation, a genetic programming operation, a conjoint analysis operation, a generative 
grammar operation, a generator of random attributes operation, or any other to generate a 
derived group of product forms. The program may select from a set of product attributes 
to assemble a derived set of product forms and/or may exploit a function which can 
generate new or modified attributes. The program also may permit a selector to delete a 
generated product form, to introduce a new product form within a derived group of 
product forms, to impose a constraint on the generation of a derived group to those forms 
comprising a preselected attribute or attribute value, or to those not comprising such 
particular attribute or attribute value, or to specify an attribute of the product or other 
object of the exercise. The apparatus may further comprising means for storing a 
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plurality of product forms preferred by a selector and electronic means for effecting a sale 
to a selector of a product form she selected. 

[0044] These various computational techniques are not per se considered an aspect of the 
invention, except insofar as they are used in combination with other process steps as set 
5 forth herein or as may be set forth in some of the appended claims. The invention also 
includes systems utilizing multiple levels of genetic or evolutionary computation 
techniques where, for example, the output of a first algorithm is used as the input of the 
next. The computer programs may embody various acceleration strategies, i.e., code 
implementing logic that reduces the participants' voting load, for example by using 

10 adaptive statistical models of the participant to evaluate some of the designs, or code that 
may reduce the number of design cycles needed to discern adequate or optimal forms by 
seeding the product form populations with "good" designs, by evolving higher-level 
modules first in the case of designs that are modular in nature, or by the use of various 
constraint parameters to reduce or eliminate impractical or impossible designs. 

15 [0045] The method broadly comprises the steps of presenting, e.g., through a computer 
display or output device of some type, to the selector a group of product forms, each of 
which has a particular combination of attributes. The way these initial product forms are 
designed or chosen is not critical, but may involve screening of candidate designs to 
reflect previously articulated preferences of the selector or a supervisor. The presentation 

20 typically is made electronically, e.g., by presenting graphical, alpha numeric or other 
visual data representative of the design alternatives or forms. Visual sensing of the 
presentation is not a requirement of the invention as the product being designed or 
selected may by an audible product sensed aurally such as a tune or a jingle. Attributes of 
the product may be sensed tactilely to discriminate among or between smoothness, 

25 texture, temperature, ergonomic curvature or softness, or degrees thereof. It is also 
possible to employ the methods and apparatus of the invention to design or select 
fragrances sensed nasally and tastes sensed orally or orally/nasally. 
[0046] Next, the methods of the invention have the selector express a preference for a 
subset (one or more) of the presented product forms, and data indicative of the preference 
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expressed by the selector is captured for use in evolving design alternatives. As disclosed 
herein, a variety of voting schemes may be used, with the selection of the protocol for 
gathering, aggregating, screening, or otherwise conditioning the data being dependent on 
the goals of the exercise. The captured data is entered into a program for generating a 
derived group, or "next generation" of product forms. These including product forms 
having either or both a new attribute (e.g., a new color or a new shape for a part or 
component of a product design), attributes with new values, or a new combination of 
attributes. As noted above, the program exploits various known or as yet to be developed 
approaches, strategies, data treatment methods, and algorithms to generate the derived 
group or next generation. The important aspect of the program in accordance with the 
invention is that the captured data influences the construction of the derived forms. 
[0047] The program may select from a set of product attributes to generate at least a 
portion of a given derived set of product forms, or may exploit a function which creates 
or modifies an attribute. The program also may permit or encourage a selector or a third 
party, e.g., the proprietor or supervisor of the system, to delete a particular generated 
product form or to introduce a new product form at any point in the cycle. Also, the 
program may permit a third party or the selector to constrain generation of a derived 
group to those comprising (or, alternatively, not comprising) some preselected attribute 
(or attribute value) so as to enrich (or alternatively deplete) the population of derived 
product forms with that attribute, i.e., may be responsive to boundary conditions set by 
the selector or a supervisor controlling the system. 

[0048] Adaptation of these computation techniques (or as disclosed below, voting 
techniques) for a given goal involves, for example, in the case of the market research 
embodiment, controlling the algorithm/program so that the participants (typically a large 
number of consumers on line) are provided through the computer program with a variety 
of product forms in successive generations which are designed specifically to present 
eclectic, widely varying design alternatives so as to promote exploration of the design 
space having diverse combinations of product attributes. Alternatively, or in addition, the 
computer program generates derived product forms which converge on a set of product 
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attributes matching the preference of one or a subset of consumers, i.e., evolving toward a 
"fit product" - one that best matches the consumer's preferences. In still another aspect, 
the computer program generates derived product forms which converge on a plurality of 
forms of products having sets of product attributes matching the preferences of a 
corresponding plurality of subsets of consumers. Thus, the system can permit 
identification both of groups of consumers with similar preferences and designs which 
satisfy that preference. 

[00491 In yet another preferred embodiment, referred to herein as the "virtual 
salesperson" the invention provides a method for promoting selection by a shopper of a 
product from among a large number of similar product forms having alternative 
attributes. The method comprises the steps of presenting electronically to a shopper a 
group of product forms, each of which has a particular combination of attributes, enabling 
the shopper to express a preference for a subset of the presented product forms he or she 
prefers, capturing data indicative of the preferences expressed by the shopper, inputting 
the data into a program for generating a derived group of product forms, including forms 
having a new attribute or new combination of attributes, the generation of which is 
influenced by the captured data, presenting to the shopper at least some of the derived 
group of product forms, and repeating the data capture, inputting, new choice generation, 
and presentation steps until a stopping criterion is met, typically a purchase decision. 
Again, the method is implemented preferably by an electronic network, most preferably in 
the Business to Business and Business to Consumer contexts via the internet. 
[0050] For embodiments in which the system serves as a "virtual salesperson," one can 
control the algorithm/program so that the participant (typically a single shopper) is 
provided through the computer program with designs (purchase options) of preexisting 
products or services, products that can be manufactured easily, or product inventories that 
are available for sale. By "preselecting" the new product alternatives in respective 
generations, the system leads the shopper to a product he or she prefers among existing, 
particularly profitable, or overstocked wares, or to the form of the product he or she finds 
most appealing. 
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[0051] In the embodiment of the invention referred to herein as a "design engine" the 
selector is a relatively large group of consumers, or persons who may or may not work for 
the same organization, be members of a common demographic group, or include 
professional designers. In the aspect of the invention concerned with facilitating the 
collection of market data, the selector typically is a group of consumers. 
[0052] In one important embodiment, the invention provides methods of collectively 
designing a product having a potentially large number of forms, each of which has 
alternative attributes. The method may be embodied in a suitably configured computer or 
network of computers which serve as a design engine. The method comprises the steps of 
presenting electronically to each of a plurality of persons a group of product forms, each 
of which has a particular combination of attributes, enabling the persons to express a 
preference for a subset of the presented product forms they prefer, capturing data 
indicative of the preferences expressed by the persons, inputting the captured data into a 
program for generating a derived group of product forms (including forms having a new 
attribute or new combination of attributes, the generation of which is influenced by the 
captured data), and presenting to a plurality of persons the derived group of product 
forms. The process steps are iterated until a stopping criterion is met, such as the 
discovery of one or a plurality of product forms preferred by said persons. Then, one may 
produce a plurality of units of a product based on a selected product form, and sell a 
product based on a selected product form to one or more of the person or to others. The 
selecting persons may be, for example, professional designers or members of a focus 
group. 

[0053] The method may involve iterating the cycle of selection and derived product form 
generation a sufficient number of times to permit determination of one or a plurality of 
product forms preferred by the selector. Particularly in a group design effort where the 
selector is a group of consumers, this may lead to the identification of more than one 
preferred design. Collecting demographic data about the selector and correlating the 
product forms preferred by the selector to the demographic data permits identification of 
market segments which may be exploited using differing strategies. Accordingly, the 
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invention facilitates a new form of market research, in which its proprietor is enabled to 
discern the relative affinity of a consumer or group of consumers for a given product 
form, or to discern market segments, for example, early adapters, late majority, etc. 
[0054] In another embodiment, the invention provides a method of reaching consensus 
among a group of participating persons, such as business managers, on a plan or menu 
having a potentially large number of alternative attributes. The method comprises the 
steps of presenting electronically to each of a plurality of participating persons a group of 
alternatives, each of which has a particular combination of attributes. The persons 
express a preference for a subset of the presented alternatives. Data indicative of the 
preferences expressed by those persons are entered into a program for generating a 
derived group of alternatives, including plans having a new attribute or a new 
combination of attributes. The generation of the new, derived alternatives is influenced 
by the captured data, and the derived group of plan alternatives then is presented to a 
plurality of participating persons. The data gathering, new plan generation, and 
presentations are repeated until a consensus is achieved. This process may be 
implemented on an intranet as groupware, or on the internet. The generated alternatives 
may be plans that are preferred by a supervisor which are presented to the participating 
persons so as to induce them to choose attributes of a supervisor-preferred plan. Again, 
the method may involve the additional step of constraining generation of a derived group 
of alternatives to those comprising a preselected attribute or set of attributes. 
Brief Description of the Drawings 

[0055] The advantages of the invention described above, together with further 
advantages, may be better understood by referring to the following description taken in 
conjunction with the accompanying drawings. In the drawings, like reference characters 
generally refer to the same parts throughout the different views. Also, the drawings are 
not necessarily to scale, the emphasis instead is placed on conveying the concepts of the 
invention. 

FIG. 1 is a block diagram illustrating one embodiment of the system for 
performing the invention. 
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FIG. 2 is a flowchart showing one embodiment of the process steps of decision 
making or design exercises conducted in accordance with the invention. 

FIG. 3 is a flowchart showing one embodiment of the steps to be taken in an 
exercise involving a multipurpose selector entity with purchase decisions as the outcome. 

FIG. 4 is a flowchart showing one embodiment of the steps to be taken to identify 
market segments in an evolutionary design exercise. 

FIG. 4A is a flow diagram depicting one embodiment of evolutionary algorithm 
featuring speciation and niching. 

FIG. 4B is a flow diagram depicting one embodiment of the steps to be taken to 
compute mating probabilities. 

FIG. 4C is a flow diagram depicting one embodiment of the steps to be taken to 
compute and entities niching discount. 

FIG. 5 is a screenshot depicting one embodiment of a registration page useful in 
connection with the invention. 

FIG. 6 is a screenshot depicting one embodiment of a dialogue screen useful in 
connection with the invention. 

FIG. 7 is a screenshot depicting one embodiment of a screen useful for receiving 
user input. 

FIG. 7 A is a screenshot depicting a particular preference assessment prior to vote 
submission 

FIG. 7B is a screenshot depicting one embodiment of a second voting screen 
following vote submission. 

FIG. 7C is a screenshot showing an embodiment of a voting screen featuring a 
"pick panel" and a "progress bar." 

FIG. 8 is a screenshot depicting an embodiment of a display of items based on 
their R-space representation. 

FIG. 9 is a screenshot depicting an embodiment of a display of items based on 
their feature representation. 
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FIGs. 10 and 1 1 are screenshots showing the items presented to participants 
during one embodiment of a design exercise. 

FIG. 12 is a screenshot depicting one embodiment of an R-space plot. 

FIG. 13 and 14 are screenshots depicting the distribution of feature genes in one 
5 embodiment of a design exercise. 
Detailed Description of the Invention 

[0056] FIG. 1 shows one embodiment of an environment in which the present invention 
may be used. Selectors may use one or more client systems 10, 20, 30, 40 to 
communicate with one or more server computing systems 50, 52, 54 over a network 100. 

10 The network 100 can be a local-area network (LAN) such as an Ethernet network or a 
wide area network (WAN) such as the Internet or the World Wide Web. Client systems 
10, 20, 30, 40 can be connected to the network 100 through a variety of connections 
including standard telephone lines, LAN or WAN links (e.g., Tl, T3, 56kb, X.25), 
broadband connections (ISDN, Frame Relay, ATM), and wireless connections. The 

15 connections can be established using a variety of communication protocols (e.g., TCP/IP, 
IPX, SPX, NetBIOS, Ethernet, RS232, and direct asynchronous connections). For 
example, the network 100 may be a corporate intranet connecting decisionmakers in an 
organization to a centralized decision engine, or it may be a secure extranet or virtual 
private network connecting different entities such as a company's suppliers or consultants 

20 to the company's design engine. 

[0057] As shown in FIG. 1, client systems 10, 20 may be client computing systems 
typically used by a user, such as any personal computer (e.g., 286-based, 386-based, 486- 
based, Pentium-based, iTanium-based, Power PC-based), Windows-based terminal, 
Network Computer, wireless device, information appliance, X-device, workstation, mini 

25 computer, mainframe computer, personal digital assistant, or other computing device. In 
these embodiments, client systems 10, 20 may use any one of a number of windows- 
oriented operating systems such as Windows 3.x, Windows 95, Windows 98, Windows 
NT 3.51, Windows NT 4.0, Windows CE, Macintosh, Java, Unix, and Linux. In this 
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embodiment, the selector comprises the user 12, 22 interacting with the system via the 
client devices 10, 20. 

[0058] In other embodiments, a client system 40 is an information kiosk located in a 
retail establishment. In these embodiments, the client nodes 40 may include a touch- 
sensitive screen or membrane keyboard for receiving consumer input. In other 
embodiments, the client system 40 is a retail point-of-sale terminals that collects 
consumer reference information from sale transactions. Client system 30 in FIG. 1 
depicts an embodiment of a selector that is a proxy for a real person, such as a computer 
programmed and trained as a neural net, a statistical model, a distribution estimation 
algorithm, a reinforcement or Q learning method, a learning classifier system, or other 
machine learning methods or expert systems. In these embodiments, client system 30 
may be one or more processes (threaded or otherwise) that implement evaluative models 
or algorithms, such as neural net models, learning classifier system, statistical models, or 
an expert system, which emulate the voting preferences of a human and which vote by 
proxy. These processes may execute on client system 30 and communicate with server 
systems 50, 52, 54 via network 100. Alternatively, the client system 30 may execute on 
the server systems 50, 52, 54 and communicate with various server processes using pipes, 
shared memory, or message-based communication such as remote procedure calls. 
[0059] In many embodiments, one of the servers 50, 52, 54 is responsible for presenting 
to selectors the initial population of product forms, generating the derived product forms 
to be presented to the selector, and capturing and processing the data that is indicative of 
the selector's preference. This server is referred to as the "presentation server." An 
attribute database 60 stores the possible attributes available for generating product forms. 
A voting database 70 stores the preference data obtained from the selector during the 
course of the process. In some embodiments a single database is used to store both the 
possible product attributes as well as obtained preference data. 
[0060] Another of the servers 50, 52, 54 implements generative and evolutionary 
computation programs that utilize the stored attribute data and the stored preference data 
to generate representations of the product forms. This server is referred to as the 
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"generate server." The presentation server processes these product form representations 
to generate product forms that can be presented to the selector. 

[0061] Yet another of the servers 50, 52, 54 serves as a vote aggregation analysis server. 
This server plays several roles: it captures the preference data coming from the selector 
and stores it in the voting database 70; it also analyzes the data and transforms or 
conditions it into a format that can be used by the generate server; it is also used to 
develop models, such as statistical or neural net based models or other machine learning 
models of the selector preferences, and may use these models to eliminate some of the 
forms generated by the generate server prior to presenting them to the selector. 
Additionally, it may provide data indicative of the preference of subsets of the selector, 
which may be appended to the presented forms by presentation server. Although depicted 
as separate servers, the generate server, presentation server, and vote aggregation/analysis 
server may be embodied as any number of physical servers. 
[0062] For embodiments in which the invention allows for or exploits a purchase 
decision by the selector or subset thereof, one of the servers 50, 52, 54 may be an e- 
commerce server. For example, a purchase decision may provide one of the stopping 
conditions for a design exercise, or individuals comprising the selector may be permitted 
to place a purchase order for one of the intermediate product forms that they find 
satisfactory. The e-commerce server, which is well understood by those skilled in the art, 
uses a database containing customer information such as billing information and shipping 
address. The e-commerce server may be used to obtain the relevant billing and shipping 
information from the client, process it, store it in the database, and forward the relevant 
data to the order fulfillment entity. 

[0063] The selector also may comprise one or more computers programmed as a 
statistical model, neural net, learning classifier system, other machine learning models, or 
with other appropriate software algorithms "trained" to mimic or simulate a consumer's 
preference pattern. Such a surrogate selector can, among other things, facilitate the 
feedback and evaluation process during a computer-driven emergent design cycle. A 
suitable computer program can facilitate or even eliminate the consumer's participation 
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except perhaps as a supervisor. For instance, after going through a training phase, an 
evaluation program may express a suggested preference pattern (evaluation) for a given 
set of alternatives for the consumer to accept or adjust before submission as input to the 
generation program. After repeated cycles, the consumer may allow his or her personal 
evaluation program to provide unsupervised input to the generation program for several 
cycles before pausing to allow the consumer to make adjustments. Ultimately a sense of 
trust may develop between the consumer and the evaluation program that allows the 
evaluation program to act as a proxy for the consumer. An advantage of such a method is 
that the evaluation program-generation program can interact for several cycles starting 
from many initial seed evaluation sets (alternatives) in order to scout more fully the 
fitness landscape between the consumer preferences and particular design alternative. 
[0064] The neural net, learning classifier system, machine learning system, expert 
system, or other type of evaluation programs can be trained using a set of emergent design 
cycles with computer generated alternatives and consumer specified evaluations. The 
prospect of having a personalized evaluation program available to assist in the future 
interactions with the emergent design process may be an inducement to the consumer to 
engage in a large set of design cycles. 

[0065] In other embodiments of the invention, the selector consists of a single individual, 
i.e., the system is a single-user system. In this case, there are no multiple votes to be 
aggregated and analyzed. Therefore, the voting database 70 is used to store the 
preference data throughout the design exercise or decision making process for the 
particular selector. The voting database 70 may also contain preference data from other 
selectors who may have participated in similar exercises previous to the current one, 
including data from exercises in which the same selector may have participated 
previously. One of the servers 50, 52, 54 collects, analyzes and stores the incoming 
preference data from the selector; it may also be used to provide feedback to the 
participant by providing data to the presentation server, which is indicative of the 
evolution of the selector's preferences over the duration of the exercise, or which may 
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provide the selector with a basis for comparing present preferences to data stored in the 
voting database 70. 

[0066] This embodiment may be used to implement the virtual salesperson embodiment 
of the invention. It may also be used to implement a one person design exercise via an 
application service provider model. Of course, such a system alternatively could be 
embodied in a single, suitably programmed computer. 

[0067] FIG. 2 is a process flow diagram for an exemplary decision making or product 
design exercise, embodying a method of the invention. 

[0068] In this example, the process starts with identifying the object of the exercise, that 
is, the decision object or the design object, represented by block 210. At this point, the 
object is identified in very general terms, such as, "the colors of a tennis shoe," "next 
week's meeting agenda," "the menu for next month's association meeting." In some 
embodiments, the step of identifying the object of the exercise 510 is skipped such as in 
the "virtual salesperson" embodiment. Next, in step 211, those attributes of the object 
that will be permitted to change during the exercise are identified, and the different values 
that they will be permitted to take on are determined. For example, in the case of the 
colors of the tennis shoe mentioned above, step 21 1 may involve identifying the 
individual elements of the shoe which are subject to design variation; the result may be: 
the vamp color, the eye stay color, the tongue color, the heel color, the sole color, and the 
laces color. Furthermore, the range of colors that each of these elements may take are 
established. For example, the laces may have three different colors they can take on, e.g., 
white, black, and red, or there may be four shades of red, or red attributes with different 
values. Similarly, there may be eight colors that the vamp may take, e.g., red, blue, white, 
green, orange, black, yellow, and purple. Furthermore, in other embodiments, certain 
constraint rules may be implemented that prevent, for example, a certain color of laces to 
be used with a particular color of the tongue. In other embodiments, an attribute may 
have a continuous range of values. 

[0069] The next step in the process, represented by block 212, involves determining the 
representation or genotypic coding that will be used to represent the particular design or 
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decision object internally, in the genetic algorithm, genetic program, or other GEC 
program. In the case where a genetic algorithm, the "genotype" is a data structure that 
encodes each attribute value, such that a particular instance, combination of attributes, or 
"value" of that structure represents one particular product form. It is directly tied to the 
previous step 21 1, in which the attributes and their possible values are decided, 
sometimes called "featurization." Continuing with the example of the tennis shoe colors, 
an appropriate genotype might consist of six integers strung together, each of which can 
be thought of as a gene representing one of the identified features such as the laces color. 
That integer in turn would be limited to taking on distinct integer values, here, as an 
example, three, say 0, 1, and 2, each of which is used to represent one of the three 
predetermined, allowable colors for the laces. This example genotype structure is shown 
schematically below. 



Integer 1 


Integer 2 


Integer 3 


Integer 4 


Integer 5 


Integer 6 


Represents 
vamp color 


Represents 
eye stay 


Represents 
tongue color 


Represents 
heel color 


Represents 
sole color 


Represents 
laces color 



color 



Range: 0-7 ... ... ... ... Range: 0-2 

[0070] In another example, it may be that the values that a gene can take on are not 
indices for predefined attributes, but rather represent a physical parameter. For example, 
if one of the design parameters identified in 21 1 was the height of the heel of the shoe, 
then a gene coding it might be a real number allowed to take on values, for example, 
between 0.5 and 1 .5, where the number represents the actual height in inches. In another 
case, the integers or real values by the genotype may represent parameters that are used in 
a complex computer-aided design program that generates different forms based on these 
parameters; for example, the parameters may represent the dimensions and radii of 
curvature of certain shapes, and/or the parameters of a Bezier curves that make up the 
shape. 
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[0071] In one embodiment, products may be described as models in a CAD/CAM system, 
and design features may be extracted from the CAD/CAM model of the product 
automatically by the CAD/CAM system. For example, a product may be represented in a 
CAD/CAM system by a table linking model attributes and the specific value of an 
attribute. The model attributes may be thought of as the respective "genes" for a product 
and the specific values of the variables as the "chromosome" values or specific "alleles". 
The attribute values can be manipulated by making API calls to the CAD/CAM system. 
[0072] The next step in the flowchart, represented by block 213, involves determining a 
mapping or transformation from the genotype representation described above, a data 
structure internal to the evolutionary algorithm, to the phenotype which is the 
corresponding form representation that can be presented to the selector. In the example 
of the tennis shoe color, this mapping is trivial, as there is a simple correspondence 
between a particular feature color and its index value. In other cases, this mapping may 
be more complex. For example, in a case where genetic programming is used, the 
genotype may encode a program or set of instructions that generate a product form, say a 
geometric shape, or determining the parametric computer aided design model of a shape, 
the parameters of which are encoded by the genome. 

[0073] The preceding steps, 210 through 213, are preparatory steps for the iterative part 
of the process, which begins at 214. In 215, an initial population of possible solutions — 
possible designs, possible decisions, possible menus — is generated. In the language of 
Genetic and Evolutionary Computation, this initial population is often referred to as a 
seed population or trial population. Typical population sizes may range from 2 to 
100,000. In some embodiments, typical population sizes range from 3 to 50,000. In 
more preferred embodiments, typical population sizes range from 4 to 10,000. In still 
more preferred embodiments, typical population sizes range from 5 to 1,000. In a most 
preferred embodiment, the typical population size ranges from less than 50 to 600. Each 
member of the population is an instance of the genotype described earlier, that is, a data 
structure where each field or "gene" takes on one of its allowable values; these are also 
referred to as chromosomes. The seed population may be generated by picking random 
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values from the allowable ranges for each field in the chromosome. Other possible ways 
to populate the initial set of possible solutions is to use chromosomes that are the result of 
a previous exercise, ones that represent product forms designed by people using other 
(more traditional) means, or among other methods which depend in part on the goals of 
the process. 

[0074] Once the initial population is generated, it is presented to the selector for 
evaluation. This step is represented by 216. Presenting the possible solutions may 
require using the genotype to phenotype transformation scheme that was determined in 
213. Step 216 may involve presenting the whole population of possible solutions to the 
selector, or it might involve presenting a subset of that population. In some particular 
embodiments, the selector is presented with subsets of, or "windows" onto, the global (in 
this case, the initial) population. For embodiments involving CAD/CAM systems, step 
216 requires the CAD/CAM system to render the respective members of the population. 
At a minimum, two of the possible solutions are presented to the selector. At 216, along 
with the presentation of the product forms, the selector also is presented with means for 
expressing a preference among them. This can be implemented in any number of ways, 
from clicking on the ones that are deemed good, to moving the assigning grades to the 
various forms presented, ordering the forms by order of preference, and so on. Each of 
these methods results in particular types of preference data that is captured and used in 
the next step. In particular, one way a selector may indicate his of her preference is by 
issuing a purchase request for one of the forms presented, one that is presumably deemed 
satisfactory. 

[0075] In step 217, the preference data from the selector is collected and analyzed 
typically electronically. In the case where more than one individual comprise the 
selector, the preference data from the different individuals must be aggregated and 
conditioned to make it usable in the subsequent steps of the process. In this step, any 
number of vote aggregation methods may be used. It should be noted that the vote 
aggregation method and the method provided to the selector to express their preference 
are technically related. 
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[0076] The voting systems useful in the practice of the invention may be generally 
classified into five categories: (1) majority rule and majoritarian methods; (2) positional 
methods; (3) multi-stage methods; (4) utilitarian methods; (5) and proportional methods. 
[0077] Majority Rule and majoritarian methods rely only on information from binary 
comparisons between alternatives. Perhaps the most familiar example of a majority rule 
is the presidential election process in the United States, which is often a choice between 
two candidates. The winner of a majority rule election scheme is the alternative (or 
candidate) preferred by more than half of the voters. For cases in which more than two 
alternatives are presented, then some other procedure, such runoff elections, are needed to 
whittle the number of alternatives down to two (or to group the alternatives into two 
groups). Simple majority rule can be applied to more than two alternatives by performing 
pairwise comparisons and eliminating the alternatives that lose out in these comparisons. 
In this method, the winner may depend on the order in which the pairwise comparisons 
are performed. Other majoritarian systems include the Amendment Procedure, the 
Successive Procedure, The Copeland Rule (which uses pairwise comparisons and counts 
losses as well as wins), and the Schwartz rule, among others. 
[0078] Positional Methods utilize more information about voters' preference ordering 
than majoritarian methods (but not the whole ordering necessarily.) In plurality voting 
(also known as first past the post) every voter votes for his or her most preferred n 
alternatives, where n is the number of candidates to be elected. The alternatives with the 
most votes win. Unlike majoritarian methods, due to vote splitting in plurality voting, it 
is possible for two similar candidates both to lose to a third candidate that is different 
enough, even though it is less preferred by the overall electorate. Positional methods are 
particularly relevant to several preferred embodiments of the present invention, as these 
involve presenting a number of alternatives to the participants in the exercise, and asking 
them to rank the alternatives by order of preference. 

[0079] In Approval voting, voters can pick as many of the alternatives as they wish (all 
the ones they "approve of." The winner is generally the alternative that receives a 
plurality of votes (more votes than the others). 
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[0080] In Borda Count voting, which is an example of a "scoring" or "point" method, 
each voter gives a number of points to each alternative, as follows: the most preferred of 
the n alternatives is given n-1 points, the next most preferred is given n-2 points, all the 
way to n-n or 0 points to the least preferred alternative. The winning alternative is the 
one that receives the most votes. 

[0081] Multi-Stage Methods use different functions or mechanisms at different stages of 
the voting process; they may also use the same mechanism iteratively on a decreasing 
number of alternatives. One advantage of these methods is that a voter need not fear 
wasting his or her vote if they choose an alternative that is unlikely to win. One such 
method is Black's method, which selects the Condorcet winner if one exists (through 
successive pairwise comparisons); if a Condorcet winner cannot be found, it selects the 
Borda count winner. Another multi-stage method is the runoff procedure, briefly 
mentioned earlier, in which, absent a majority winner in the first round of balloting, a 
runoff simple majority election is held between the two alternatives that received the most 
votes. Another multi-stage method is Nanson f s Borda-Elimination procedure, which 
applies the Borda method successively, eliminating the lowest scoring alternative at each 
round, until the winner remains. An advantage of this approach is that, unlike the regular 
Borda method, it will never pick the Condorcet loser. 

[0082] Single-Transferable Voting (SVT) or Hare's procedure is popular in England. In 
this method, voters submit their preference ranking over all candidates. Any candidate 
who receives more then a threshold number of first places is elected. If the elected 
candidates receive more votes than are necessary for election, the excess votes they have 
received are redistributed over the remaining candidates based on the second-choice 
preferences of the voters. And again, any voter who receives more than the necessary 
number of votes, following the redistribution of the excess votes, is elected, and a new 
round of redistribution is carried out. If no more excess winning votes are available, and 
the necessary number of winners has not been reached, the lowest scoring candidate is 
eliminated and the votes for that candidates are redistributed. 
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[0083] There are many variations on the SVT procedure, depending on the threshold 
needed to win, depending on the procedure used to redistribute the freed votes, and 
depending on the method used to resolve ties. One method for redistributing the votes 
involves "controlled randomness." SVT can be used to elect only one alternative, in 
5 which case redistribution involves votes from eliminated candidates only. (This method 
is also known as Alternate Vote or Majority Preference.) 

[0084] Coomb's procedure is similar to SVT, except that the alternatives that garner the 
most last places are eliminated (and their votes redistributed.) Whereas STV tends to 
select the most intensely liked alternative, Coomb's procedure tends to select the 

10 alternative that is least disliked by the majority, 

[0085] Utilitarian Methods. Unlike the methods discussed so far, which only required 
the voter to provide an ordinal ranking of the alternatives, Utilitarian methods require a 
cardinal rating. The voters are asked to assign utility values to each of the alternatives 
presented to them. These utility values are intended to reflect the amount of happiness or 

15 satisfaction the voters expect to derive from each alternative, using a finite scale 

(commonly used scales are those that go from 1 to 5, 1 to 7, 0 to 10, or from 0 to 100.) A 
distinction should be made between interval scales, and ratio scales; in the former, the 
zero has no meaning, and it is only the difference between values that is meaningful; in 
the latter, the zero does mean absence of the characteristic that is being measured. The 

20 outcomes in utilitarian methods is based on the aggregation of the utility values given by 
the voters for the various alternatives. 

[0086] To farther clarify the difference between the methods presented so far and the 
present utilitarian methods, it has been noted that majoritarian methods base decisions on 
how many times x is ahead of one other alternative. Positional methods base decisions on 
25 how many times x is ahead of all other alternatives. Neither of these methods bases 
decision on the voters' direct valuation of the alternatives (although positional methods 
are sometimes mistakenly so interpreted.) Utilitarian methods account for the intensity of 
judgment, that is, for how much an alternative x is ahead of another alternative y. The 
following methods are described by Riker and Mueller. 
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[0087] Direct Aggregation of Cardinal Utility, such as may be used in one of the 
preferred embodiments described herein, is the simple Summation of Cardinal Utility 
method. In this method, the utility values for each alternative are added and the 
alternative receiving the largest sum wins. Another method involves the multiplication of 
utilities, where the utility values are multiplied instead of being summed. Variations on 
these methods involve normalizing the utility values before using them (by fitting them to 
some normal scale). One problem with these voting methods is the tendency voters have 
to inflate the utility value they assign to their favorite alternative (to increase its chances 
of winning), and to deflate the utility they assign to the alternatives they dislike. 
[0088] One variation the invention may exploit is the case where different voters or 
groups or voters are given more or less voting power than others, through the use of 
weighting factors in the summation or the multiplication of the utilities. This amounts to 
a "super voter" scheme, e.g., a manager or designer could be given more voting power 
than others constituting the selector. Note that in the ordinal voting schemes described 
before, super voter status would involve giving the super voter more than one voice. 
[0089] Demand-Revealing Methods, This method attempts to prevent the problem with 
direct aggregation discussed above, where voters inflate their valuation for their preferred 
outcome. The idea is to have voters vote by offering to pay a certain amount of money m 
in order to obtain a preferred alternative. The amounts of money offered for each 
alternative are summed, and the alternative that garners the largest sum wins. Voters 
whose offers for the winning alternative exceed the margin of victory must pay a tax 
equal to their contribution to the victory. The tax is not supposed to go to anyone 
involved in the voting system, in order not to corrupt their behavior. One downside of 
such a system is that the tax may not effectively deter those voters with a greater 
endowment of money. In the context of the invention, typically some token of value 
would be used; and the tax would consist of some form of penalty that may involve the 
tokens or something else of value to the participants in the context of the exercise. 
[0090] In one aspect the invention contemplates switching of voting scheme from one 
system to another as the design exercise progresses. As will be apparent from the list of 
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voting methods noted above, some are better on certain measures of fairness, such as 
maximizing participation, while others are better at quickly finding an alternative that 
potentially only a small part of the persons making up the selector feel strongly about. In 
other words, the voting system in use during an exercise conducted in accordance with 
the invention at any given point during the exercise may help exploration, for example, 
when exploiting the market research embodiment, or help reach rapid optimization, or 
convergence to a particular design favored by a person or set of person comprising the 
selector's representative, for example, of a particular market segment. Thus, for example, 
the invention can be practiced by switching between voting paradigms during the course 
of the exercise to help exploration early on and then drive toward a solution in a later 
stage. This general concept has been recognized as having value of certain standard 
techniques used in genetic algorithms for preventing premature convergence and allowing 
exploration early on. These have to do with the scaling of the fitness data that's fed back 
to the genetic algorithm from the evaluation function. Of course, this can be done in 
accordance with the practice of this invention. Alternatively, the selection and change of 
voting schemes are used to effectively accomplish the same thing. 
[0091] Still another aspect the invention contemplates running simple voting systems in 
parallel and, for example, comparing the output at each generation or at assigned posts 
during the course of the exercise. A decision-making scheme or rule or supervisor then 
may decide which one to use or possibly to use some combination of their outcomes to 
drive the next iteration in the process. Such a decision could be based not only on the 
current voting data at the time the assessment is made but also on the outcomes of 
different vote or data aggregations schemes and voting history or earlier iterations. Of 
course, this technique may be used by the computer program generating the derived 
product alternatives. However, again, a similar result may be achieved by running several 
voting systems in parallel 

[0092] In step 218, the preference data as well as other parameters of the exercise (such 
as the time elapsed, the number of iterations run, etc.) is tested to see whether a stopping 
condition has been met. If a stopping condition has not yet been met, the process moves 
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on to step 219. In this step, the genetic computation operations are performed on the 
population of possible solutions, in order to generate a "new" or derived population of 
solutions. The algorithms used at this stage may vary widely as noted above. In the 
preferred form, the operation is a genetic algorithm with real and integer-valued genes. 
The operators that are typical in most implementations of genetic and evolutionary 
computation include selection or reproduction operators, recombination or crossover 
operators, and mutation operators. Reproduction operators basically create copies of the 
members of the current generation of solutions as a function of their fitness. Those 
possible solutions that were preferred by the selector, that is, that were found by the 
selector to have a high degree of fitness, are more likely to be selected and reproduced 
than the ones that were found to be less desirable. It should be noted that most 
implementations of reproduction operators are not deterministic, but involve an element 
of randomness. In other words, it is the likelihood that a possible solution will be 
reproduced that varies in accordance with its fitness. It should also be noted that a highly 
fit solution may result in several copies of that solution showing up at this intermediate 
stage of reproduction. 

[0093] Another operator is the crossover operator, which acts on the intermediate 
population of solutions that is the outcome of the reproduction operation. In crossover, 
members of the intermediate population are paired, and the two chromosomes of each 
pairing are split and the different parts cross-combined, resulting in a pair of offsprings, 
i.e., new pair of possible solutions. The schematic below represents the case of single 
point crossover. 
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The resulting two chromosomes, following single point crossover between genes 2 and 3. 
[0094] It should be noted that the pairing process could be performed at random, or it 
could be based on the fitness or desirability of the different chromosomes. In addition, 
certain schemes may pair parents according to their genetic similarity or dissimilarity (we 
describe a more complex assortative mating scheme later herein.) Furthermore, decision 
to effect a crossover operation on any given pair may involve an element of randomness. 
In single point crossover, the crossover location may be determined at random as well. 
(Some of the embodiments used in the invention, and described later, result in a single 
offspring for each pair of parents.) 

[0095] A number of crossover operators have been developed by researchers and 
practitioners in the field of genetic computation; these include multipoint crossover and 
uniform crossover, each offering different performance (in terms of convergence, or the 
exploration/exploitation trade-off) under different conditions. In the case of real-valued 
genes, the crossover operator may involve both interpolation and extrapolation between 
the values of the corresponding genes in the parent chromosomes. 
[0096] Following crossover, a mutation operator is applied to the offsprings, that is, the 
results of crossover. Mutation is a random operation intended to increase the exploration 
of the space of possible solutions. The implementation depends on the particular 
representation used. In the case where a binary valued genetic algorithm is used, the 
. genotype consists of a string of 0' and l's; in that case mutation involves flipping a bit 
(from 0 to 1, or vice versa) at random, at a given probability. For example, if the 
mutation rate is 0.1%, then, on average, one in every 1000 bits encountered in the 
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population of chromosomes, one will be selected at random and flipped. In the case 
where a gene takes on an integer value, then, at the appropriate mutation probability (say, 
every one in 1000 genes on average), the integer is replaced by another one selected at 
random from the range of allowable values for that gene, or from a certain neighborhood 
of the current value of that gene. 

[0097] At that point, after all genetic computations are applied to the population of 
possible forms or solutions, a derived population is obtained, and step 219 is effectively 
complete. (There may be an additional operation applied, sometimes referred to as 
"monster killing" whereby non-allowable chromosomes that may have been generated are 
eliminated and replacements generated.) 

[0098] The derived population is now ready to be presented to the selector for evaluation 
at step 216, thus completing one iteration of the loop. 

[0099] If, at block 218, one of the stopping conditions obtains, the process proceeds to 
block 220, which represents the end of the exercise. At 220, a preferred form or several 
preferred forms 221 have been found. It is possible at this point to repeat the exercise 
with a different selector, or with the same selector but with a different initial population 
of solutions, or both. It is also possible to perform a related exercise, using different 
attributes or different attribute ranges for the same design or decision object (i.e., step 211 
is repeated to obtain different attributes, although 210 is unchanged.) This may be the 
case if a hierarchical design process is being undertaken, whereby one aspect of the 
product is designed first, then another aspect. For example, design the shape of a shoe in 
one phase, followed by choosing the color palette for it. 

[0100] The embodiment described above is referred to as a "generational evolutionary 
algorithm," where a considerable percentage of the population is replaced by offsprings. 
Steady-state evolutionary algorithms, in contrast, typically create only one or two 
offspring per iteration of the algorithm. Parents are usually chosen with a stochastic 
process that is biased in favor of more fit individuals. Once the one or two offsprings are 
made, individuals from the population must be selected for removal in order to make 
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room for the new offsprings. A great variety of removal methods exist for steady-state 
algorithms. 

[0101] For example, the individual with the worst fitness may be replaced by an 
offspring. Alternatively, the member to be replaced may be chosen using a stochastic 
process that is biased in favor of less fit individuals. Alternatively, removal may be 
effected at random, such that each individual has an equal chance of being removed. 
Crowding methods represent yet another set of replacement schemes. In these methods, 
offspring replace the most similar individual from some subset of the population. Known 
crowding methods differ on how this subset is selected and how comparisons are made. 
However, because steady-state algorithms change the population contents gradually, they 
can provide better diversity maintenance than ordinary generational algorithms. 
[0102] FIG, 3 represents the process flow for an example product design exercise with 
purchases as allowed outcomes. Blocks 310 through 316 as well as blocks 318 through 
320 are the same as described previously for FIG. 2. In this embodiment of the invention, 
after preference data is collected from the selector in block 3171, it is checked for the 
presence of any purchase requests from any member of the selector for one of the product 
forms presented in 3 16. If the preference data includes such a purchase request or 
requests, the member of the selector, along with the information identifying the selected 
product form are directed to an e-commerce server where the needed shipping and billing 
information 332 is obtained from the individual(s). The information about the selected 
product form is sent forwarded in 333 to a fulfillment center, or to a manufacturing and 
fulfillment operation 334 that is outside the described process. 
[0103] Referring now to FIG. 4, and in brief overview, a method of dynamically 
identifying a set of items for which a plurality of selectors have a similar affinity includes 
the steps of: presenting for display to a group of selectors a first group of items (step 
402); capturing data indicative of an item preference expressed by a least some of the 
group of selectors (step 404); selecting a second group of items responsive to the captured 
data (step 406); and identifying a subset of the second group of items having similarity 
among respective attributes (step 408). 
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[0104] Still referring to FIG. 4 and in more detail, a first group of items is presented for 
display to a group of selectors as described above in connection with step 216 of FIG. 2. 
For example, the items may be presented graphically, that is, a graphic representation 
such as a drawing or a photograph of the item is displayed to one or more selectors. In 
other embodiments, display of the items refers to the provision of a data file such as a 
computer-aided design (CAD) file or computer-aided manufacturing (CAM) file 
representing one or more items. In still other embodiments, items may be presented 
aurally. The items may be presented by the server computing nodes 30, 32, 34 or the 
client computing nodes 10, 20. Selection of items to be presented for display may be 
performed by the client nodes 10, 20, the server nodes 30, 32, 34, or some combination of 
client nodes and server nodes. 

[0105] Data indicative of item preferences is captured (step 404) as described above in 
connection with step 217 of FIG. 2. Item preferences may be captured at each client node 
10, 20 in response to the display of items in step 202. There exist many ways in which a 
selector may express preference across k entities of the population. The selector may rank 
the entities according to preference, for example, where the favorite entity (or entities, in 
case of a tie) receives a score of k, the next favorite a score of k- 1, and so on. 
Alternatively, the selector may rate each entity on a scale of zero to one hundred, or 
merely indicate which entities are acceptable and which unacceptable. Regardless of the 
manner in which voter feedback is given, the feedback from all voters is subsequently 
appropriately scaled such that responses are directly comparable. 

[0106] Scaling selector responses removes inconsistencies resulting from the case where 
the selector responds by rating entities on some scale (say, [0, 100]). If one selector is 
highly enthusiastic about all of the k entities, while another is very unenthusiastic, then 
the scales of the two sets of responses will not be comparable. As a result, the scores 
given by the enthusiastic selector will have more influence over the trajectory of the 
evolutionary system. 



43 



[0107] The scaling problem is solved through the use of normalization. Let Uj be the 
"raw" response, or score, given by voter i to entity / Vector u i is normalized to create 
vector g. : 

[0108] g< =^L- 

where 

*}e[0,l] 

and 

J 

The score of entity j is given by: 

[0109] Clearly, this step is unnecessary if voters respond by ranking entities, since 
ranking cannot produce a scaling problem. Where scaling problems do not exist, 
normalization may nonetheless be performed, since it does no harm to the voting data. 
[0110] Assembling a second group of items responsive to the captured data (step 406) 
involves determining the "fitness" of the members of the population, selecting, based on 
fitness, a subset of the population for mating, selecting "mates" for them , and allowing 
the resulting parent pairs to "reproduce," as described above. 

[0111] As shown in FIG. 4 A, the fitness fj of an entity j is defined to be its score (sy given 
above) divided by its niching discount. The niching discount, described in detail below, 
is a quantity that reflects the degree to which an entity adds redundancy to the population. 
By making the discount a positively correlated function of redundancy, we create a 
pressure to maintain genetic (and, presumably, phenotypic) diversity. Diversity 
maintenance is essential to successfully achieve and maintain distinct species, which can 
be viewed as separate preference profiles and/or market segments. 
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[0112] The fitness vector,/, is normalized to obtain the probabilities with which each 
entity will be selected to parent an offspring. Such a scheme is referred to as fitness- 
proportionate selection. Typically, fitness-proportionate selection is implemented by a 
simple "roulette wheel" algorithm, where each entity has a slice of the roulette "pie" that 
is proportionate in size to its probability of being selected. The wheel is "spun" once 
each time we wish to select a parent. If the probability of an entity being selected is p, 
and we spin the wheel k times, then the expected number of times the entity will be 
selected is pk. 

[0113] This roulette wheel implementation yields a multinomial distribution. Thus, if the 
number of spins is large, the observed behavior will closely match the expected behavior. 
But if the number of spins is small, the observed behavior has a high probability of 
deviating from expected behavior. For any finite number k of spins, there exists a non- 
zero probability that an entity having probability 0 <p < 1.0 of being selected will be 
selected anywhere from zero to k times. 

[0114] Baker's Stochastic Universal Sampling is an alternative to the simple roulette 
wheel that is shown to have better statistical properties. The roulette wheel is divided as 
before, but rather than a single pointer that is spun k times, k equally-spaced pointers are 
spun only once. If an entity has probability p of being selected, then SUS guarantees that 
it will be selected no less than [pk\ times and no more than [pk] times. In this 
embodiment, the "slices" on the roulette wheel are arranged randomly. For example, if 
the slices were arranged such that the smallest ones (where p < 1 / k) were next to each 
other, then no two neighbors could be simultaneously selected, as one of them would 
necessarily fall between a pair of the k equally-spaced pointers. (Other selection methods 
are described in the literature.) 

[0115] For embodiments using recombinative variational operators (i.e., crossover), the 
creation of & offspring requires k pairs of parents. Rather than use fitness information to 
select the mates, mating preferences are used. Each entity that evolves has a genome 
composed of two distinct parts. One part of the genome determines the merit traits of an 
entity - the characteristics that are evaluated by human voters and ultimately lead to the 
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entity's fitness. The other part of the genome determines the reproductive traits of an 
entity - the characteristics that express the entity's mating preferences. Reproductive 
traits do not affect an entity's fitness. 

[0116] The precise structure of an entity's merit traits - the types and ranges of allele 
values - is domain dependent. In contrast, reproductive traits are defined to be real 
numbers and are not limited to fall into a particular range. All entities have the same 
number of reproductive traits — q real-valued genes. We interpret an entity's q 
reproductive traits as a point in ^-dimensional Euclidean space. An entity prefers to mate 
with other entities that are closer to it in this ^-dimensional "reproduction" space than 
those that are farther. 

[0117] Figure 4B details how an entity picks a mate. A symmetric matrix R is computed 
where entry R, y is the Euclidean distance between entities i and j\ Our next step in 
computing mating preference is to derive matrix R' from R, as defined below. The larger 
the value of entry R' /V , the more entity i is willing to mate with entity j. An entity may 
not mate with itself, so the diagonal is composed of zeros. Specifically, the willingness 
for entity i to mate with entity j is: 



[0118] Thus, willingness to mate drops exponentially with Euclidean distance. This drop 
may be scaled using a coefficient,^. If fi = 0, then the entities have no mating preferences 
and will mate randomly (though self-mating will still not occur.) If entity i has been 
selected because of its fitness, we pick a mate for it by normalizing row i of matrix R' to 
obtain probabilities of mate selection. These probabilities are used to construct a roulette 
wheel, which is spun once to select a desired mate. This process is repeated for every 
entity seeking a mate. 




where 



r = 




:{R) 
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[0119] In this implementation, when two parent entities mate, a single offspring is 
produced. The recombination operators applied to merit traits are dependent upon the 
types and ranges of allele values allowed, and thus vary from domain to domain. The 
recombination operator applied to the parents' reproduction traits computes the arithmetic 
mean of their locations in reproduction space - the offspring is located midway between 
its parents in reproduction space. In one embodiment, after the offspring's location is 
computed, as small amount of Gaussian noise is added. In one particular embodiment, 
the added noise has a mean of zero and standard deviation of 2.0. 
[0120] As discussed above, an entity's fitness is defined to be its score divided by a 
discount factor that correlates to the amount of redundancy the entity brings to the 
population. While genotypic similarity is generally easy to measure, it is not necessarily 
an accurate predictor of phenotypic similarity, which is the space in which diversity is 
sought. Further, phenotypic similarity can be very difficult or impossible to determine, 
depending upon the nature of the problem domain and genotype-phenotype mapping. A 
species may be defined as a group of entities that is reproductively isolated from other 
groups of entities; entities within a species can reproduce with each other. The speciation 
process is driven by use feedback. If the collection of human aesthetic opinions clusters 
into two incompatible groups of designs, such that no entity belonging to the first group 
can produce a viable (high fitness) offspring by mating with an entity belonging to the 
second group, then two species will form. Niching facilitates the speciation process and 
allows species to more stably persist. Therefore, we can compute the redundancy an 
entity brings to the population by measuring its proximity to other entities in reproduction 
space. If one species begins to overpopulate the population, its members will begin to 
receive larger discounts than entities that belong to other (smaller sized) species. (An 
alternative embodiment, described later, uses genotypic similarity as the basis for 
computing the fitness discount.) 

[0121] Figure 4C shows how the niching discount is computed. As with the mate 
selection procedure, we begin with a matrix R where entry R 7J is the Euclidean distance 
between entities / and j in reproduction space. From this matrix we derive matrix R' 
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where entry R',j signifies the amount of discount generated with respect to entity j to be 
applied to entity I Given our similarity metric, we use a triangular method of computing 
similarity-based discounts: 

( -1 R. \ 

R & =max ^ — +1,0 

\ s threshold niax(i?) J 

5 where s threshold € [0,l] is a parameter that determines the minimal amount of similarity 
(or, maximal amount of ^similarity) that will generate some amount of discount. Larger 
values of s threshold decreases the minimal amount of similarity between entities i and j 
needed to generate a non-zero discount. 

[0122] In another embodiment of the invention, participants are recruited or invited to 
1 0 participate in the design or market research exercise using any number of methods. These 
could include, but would not be limited to, postal or electronic mail invitations, telephone 
calls, print or electronic advertisements, or word of mouth. These participants would be 
selected based on any number of factors or none at all, such factors including, but not 
limited to, belonging to particular user groups, fan clubs, demographic groups, 
1 5 organizations, etc. A selected subgroup would be directed to a location, which could be a 
physical location where one or more computer terminals would be set up for the 
participants to interact with, or to a Uniform Resource Locator address (URL) over the 
internet. Each participant would either be pre-registered or would be asked to sign up to 
O participate in the exercise, through a dialogue page 500 similar to the one shown in FIG. 

20 5. At that point, additional information may be collected about the participant, such as 
demographic or preference information, which may be used subsequently, either during 
the exercise to bias the choices presented to that particular participant, or in analyzing the 
data obtained during the exercise, and in presenting those results at the end of the 
exercise. In the embodiment shown in FIG. 5, a user's e-mail address, desired password, 
25 and zip code are entered in text entry boxes 502, 504, 506. Demographic information 
such as gender, age, country of origin and income range are entered using pull-down 
menus 510, 512, 514, 516. Other information is entered using check boxes 520, 522, 
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524. Other graphical user interface techniques may be used, such as radio buttons and 
sliders. 

[0123] After the preliminaries described in the previous paragraph, the participant is led 
to the exercise. In some cases, there may be more than one exercise in which the person 
has been invited to participate; in those cases, the participant is led to the different 
exercises, either in a controlled or prespecified fashion, or through a dialogue screen that 
allows the participant to select the exercise he or she wishes to work on. FIG. 6 shows 
such a dialogue. In the embodiment shown in FIG. 6, three design exercises are presented 
to the user: a polo shirt design exercise; a tee shirt design exercise, and a "demo" 
exercise. 

[0124] FIG. 7 shows a typical screen 700 that would be seen by a participant once he or 
she reaches the exercise proper. Such a screen presents the participant with a number of 
alternative choices for the design (or decision) object 702, 704, 706, 708, 710, 712. In 
this figure, the design object is a polo shirt, and the number of alternatives presented in 
this particular screen is six. Next to each design alternative, a "thumbs up" 720 and a 
"thumbs down" 722 button are provided as means for the participant (also referred to as 
"voter") to express their opinion about the design alternative in question. FIG. 7A shows 
the same screen after the participant has given the design alternative 704 a positive vote, 
and design alternative 708 a negative vote. In some embodiments, green and red borders 
may be used as a visual feedback mechanism to remind the participant of their assessment 
for the corresponding alternatives. In this figure, the remaining four design alternatives 
702, 706, 710, 712 have received neither a positive nor a negative assessment, meaning 
that the participant is neutral or ambivalent towards them, neither liking nor disliking 
them. Once the participant has input his or her assessments, votes are submitted by 
clicking on the "Vote" button 730. This results in a new set of design alternatives being 
presented for assessment to the participant, triggering a new iteration in the process 
described above. FIG. 7B shows a screen containing one such set of derived alternative 
designs. In the particular implementation described in this section, the assessment or 
voting information provided by the participant at each iteration is used in a number of 
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ways, which are described below. Before that however, the next paragraph describes the 
particular product featurization used in this example. 

[0125] The design object used in this exercise represents one particular product 
featurization. A polo shirt could be featurized in an infinite number of ways. In this 
particular example, a simplified featurization is used, consisting of the following design 
elements: the color of the body of the shirt, the style and color of the collar, and in the 
case of a particular collar type — the tab collar — the length of the tab. Each of these 
design elements or design attributes can take on any of a set of values. In the case of the 
body color, there could be, for example twelve discrete colors. In the case of the collars, 
there could be, for example four possible styles, each of which has a fixed color. Finally, 
in the case of the tab collar, the tab length could take on any real numeric value within a 
specified range (which is chosen so that the tab length cannot exceed the length of the 
body of the shirt.) A specific design candidate corresponds to a particular triplet of body 
color, collar style, and collar tab length (although the latter value may go unused if not 
needed.) 

Evolutionary Algorithm 

[0126] In this particular embodiment, an evolutionary algorithm is used to evolve the 
designs towards ones that are more fit, that is, to generate designs that are more in line 
with the preferences expressed by the voters. The genotype used to represent each design 
candidate consists of variables or genes representing the three design attributes described 
in the previous paragraph, along with additional variables that are used to control the way 
in which different design candidates are selected for mating (an operation, described in 
more detail below, in which attributes from two "parent" designs are combined to 
generate a new "offspring" design.) The first set of genes, Gl, G2, and G3, are referred 
to as the "merit" genes or variables (also, "feature" genes or "attribute" genes), since they 
are directly responsible for determining what a design candidate looks like, and therefore 
the degree of approval it gets. The second set of genes are hidden, in the sense that the 
way in which a design candidate looks to the participant is unaffected by the value of 
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these genes. These genes are referred to as "reproduction" genes or variables, for the role 
they play in mating and mate selection, as described later. In this particular 
implementation, these genes are real-valued and they represent the orthogonal dimensions 
of a Euclidean space, referred to as the "reproduction" space. Table 1 below represents 
5 the genotype for the polo shirt exercise under discussion. In this case two reproduction 
genes, Rl and R2, are used. 
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Table 1 : Schematic representation of genotype 



[0127] The evolutionary algorithm is a population-based search and optimization 
algorithm. In the present embodiment, the algorithm works with a population of designs 
of size N, where N typically ranges from 50 to several hundred. At the start of the 
exercise, this population is seeded at random, that is, by selecting allele values at random 
for each gene from the allowed range for that gene. Alternatively, the current 
embodiment allows for deterministic seeding, in order to reflect a particular desired 
starting population. 

Breeding 
Mate Selection 

[0128] When a participant submits a vote after assessing a first screen of candidates, the 
20 information is used to generate new designs and to populate the subsequent screen that is 
presented to him or her, based on the following procedure. Every design candidate in the 
first screen that received a thumbs-up vote is immediately selected for breeding; in other 
words, it selected to be a parent, call it PL Next, a suitable mate is selected for it from 
among the larger population of designs. That mate becomes parent P2. If less than half 
25 of the displayed candidates receive thumbs ups, the current implementation can be, and 
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usually is, set so that each of the selected candidates is bred twice. In this particular 
embodiment, mate selection is done stochastically based on the Euclidean distance in 
reproduction space between PI and all other members of the population at that point in 
time. More specifically, the probability that any of the N-l members of the population (or 
"entities") will be selected as a mate for PI is computed based on the following formula: 



where Pr(y | i) is the probability that entity j will be selected as a mate for entity i (with 
Pr(z |0 = °) an d where / (d & ) is a function of the Euclidean distance between entities j 
and i in the reproduction space, i.e.: 



[0129] The particular function used in this case decreases monotonically with distance; 
specifically: 



where /? is a real valued parameter that determines the strength of proximity bias in 
mating, and d max is the maximum distance in reproduction space between any two entities 
at that point in time: 



The value of J3 is in the range [0, oo), where a value of zero results in no mating bias and 
larger values give an increasingly more restrictive mating bias. 



Pr(; 1 0 = 
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[0130] Another version of function f(d iJ ) 9 also used in this embodiment, is given by: 



In this case, ^is a real valued parameter that determines the strength of distance bias in 
mating. The value of y is in the range (0, oo), where smaller numbers give an increasingly 
more restrictive mating bias. 

[0131] The distance-weighted probabilities thus computed are used to load a "roulette 
wheel", or are used in a Stochastic Universal Sampling scheme. Once a mate is selected 
for PI, that entity becomes the second parent, P2, for the offspring about to be created. 
This is done through a crossover operation, optionally followed by a mutation operation. 
[0132] In one alternative method for mate selection a genetic algorithm is used in which 
the bit string representation includes a set of functional genes (which correspond to our 
feature genes) and two other sets of genes that control mating (which correspond to our 
reproduction genes). One set of mating genes is called a mating template and the other set 
is called a tag. Both sets must have the same number of genes. The template and tag 
genes evolve along side the functional genes and are subject to crossover and mutation, A 
template gene can take on one of three values: 0, 1, or a wild-card symbol. A tag gene 
will be either a 0 or 1. Two individuals are allowed to mate if the template of one matches 
the tag of the other. If a 0 or 1 is specified for a particular template gene of one 
individual, then the same value must appear in the corresponding tag gene of the other 
individual. If the wild-card appears in a particular template gene, then any value of the 
corresponding tag gene will match. One mating scheme requires that the template of one 
individual match the tag of the other; an alternative scheme requires that each 
individual's template match the tag of the other for mating to occur. In either case, if no 
matches are found, partial matches may be allowed. 

[0133] Generally, the idea of R-space is to prevent inter-breeding between distinct 
clusters of designs. Nevertheless, occasional experiments with inter-breeding can lead to 
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important innovations. The dimensionality of R-space affects the neighborhood structure 
between clusters, and therefore the ease with which different clusters may attempt inter- 
breeding experiments. One obvious modification is to increase the dimensionality of the 
R-space from two to a higher number. Another possibility is to interbreed two designs 
that have received a thumbs-up from the same voter, perhaps within the same focus 
window. Such inter-breeding will create a small bridge in R-space between the two R- 
space regions where the parent designs are located. If the inter-breeding experiment is 
successful, then a new cluster will form. If the experiment is unsuccessful, then the 
offspring will become extinct. 

An Alternative to R-Space 

[0134] An alternative embodiment of the present invention may include an assortative 
mating mechanism that determines mate selection based upon genealogical distance, as 
opposed to the R-space distance scheme described above. Specifically, the likelihood of 
two individuals P t and Pj mating is related to the length of the shortest path that connects 
them in the "family tree." Individuals in the initial population are considered siblings, that 
is, we assume the existence of a "primary" parent that creates the initial population. The 
family tree is represented as a graph, where vertices correspond to individuals, and edges 
represent parent/child relations. Thus, an edge will exist between two individuals if and 
only if one of them is a parent of the other. Each individual records the identities of its 
parents, of which there are exactly two, so the graph is easily constructed. The only 
exception to this rule applies to individuals in the initial population, which all have an 
edge to a single parent vertex (the "primary" parent) that is inserted into the graph. The 
distance dy between two individuals P t and Pj (neither of which are the "primary" vertex) 
is the length of the shortest path between them on the graph. In the current 
implementation, the length of a path is measured by the number of distinct edges (or 
parent-child relationships) traversed to go from one individual to the other individual (as 
opposed to the Euclidian distance which is used in the R-space implementation described 
earlier.) 
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[0135] The probability of individual Pj being selected as a mate for P t is: 



(6) 



where: 



(7) 



5 and where is the length of the shortest path connecting vertices and d msK is the 
maximum over all d,j (also known as the diameter of the graph), and v m i n (a parameter < 
daaod is the minimal value that function / will return, to provide a non-zero minimum 
probability of mating for individual designs that are very far from each other. 
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[0136] Over time, the weaker branches of the family tree become extinct, leaving other 
branches that may possibly be distantly related. If so, then the different branches are 
reproductively isolated and therefore distinct species. The branches that go extinct are 
genetic combinations that are poor relative to the genetic combinations that survive. Thus, 
this method provides another approach to assortative mating. 

[0137] Crossover operations are represented schematically below: 
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[0138] In the implementation described here, the parent genomes are crossed on a gene- 
by-gene basis. In other words, the genes for the body style, Gl, from parent PI and 
20 parent P2 are "combined" or "crossed" by themselves, followed by the G2 genes 
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representing the collar style, then the G3 genes representing tab length. The reproduction 
space genes, Rl and R2 are also crossed, again individually. The particular cross-over 
operation used depends on the nature of the gene in question. For example, in the case of 
gene Gl, which represents a categorical variable, the allele value from one of the parents 
5 is selected at random. Similarly for gene G2. This is shown schematically below: 




Where a is a random variable picked from a uniform distribution: 

a ~ t/[0,l] 

[0139] Gene G3 represents an integer value, which makes it possible to use different 
crossover operators, as an alternative to the "random pick from one parent" scheme. One 
possibility is to compute interpolated and extrapolated values using the two values from 
the parents, and then to select one of these two possibilities at random. The process is 
described below. First, a Bernoulli trial (a "coin flip") is performed to decide whether to 
interpolate of extrapolate a value for the offspring gene, from the two values of parent 
genes. 

Interpolate between G3 P i and G3 P2 

Extrapolate between G3 P1 and G3 P2 
20 

[0140] Where y is either a deterministic real value between 0 and 1, or a randomly 
generated variable within that range, for example one from a uniform distribution: 
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ye (0,1) 



or 

y~U[Q,\] 

If the decision is interpolation, a formula such as the one below is used: 

G3 01 =Round(p-G3 Pl +{l-fi)-G3 P2 ) 

where p is a real value between 0 and 1, either selected deterministically or drawn at 
random, at the beginning of an exercise, or at every breeding. Alternatively, different 
deterministic values or different distributions (in the case of variables drawn randomly) 
could be used at different points in the exercise. Since G3 is an integer gene, the value 
obtained by interpolation is rounded to the nearest integer. 

[0141] If extrapolation is selected instead of interpolation, one of parent values is picked 
to determine the direction of such extrapolation; this is done at random. If Pi's is picked, 
then a formula like the following one can be used: 

G3 0l = Round(v ■ ((l + ju) ■ G3 n - fi • G3 P2 )) 

where v is a (possibly random) real valued parameter, typically less than 1 .0, chosen to 
scale down the size of the extrapolation step taken. An additional step not reflected in the 
formula above involves checking that the value thus computed does not exceed the 
allowable range for gene G3, and setting it equal to that limit if it does. 
[0142] If P2 is picked as the extrapolation direction, then the following can be used: 

G3 01 = Round(v.((l + ju) • G3 n -ft-G3 Pl )) 

[0143] The reproduction space genes, Rl and R2, being real-valued, can be treated 
similarly, except that the rounding operation is not needed. In the present 
implementation, a modified averaging operation is used, as follows: 
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Rl 0l =Y{Rl n +Rl P2 )+e 



where s is a Gaussian noise: 

s~N(0,2) 

The calculation of R2 0 i proceeds similarly. 

[0144] Many other schemes are within the knowledge of those of ordinary skill in the art. 
Mutation 

[0145] In addition to the crossover operation, or concurrent with it, a mutation operation 
is applied, to introduce occasional random variation in the design candidates that are 
generated. In the current implementation, this is done on a gene-by-gene basis again. For 
each gene, a determination is made, either before of following the crossover operation, as 
to whether a mutation is going to be applied. This is based on Bernoulli trial with a 
relatively low probability of success, around 0.01 typically. In the case of categorical 
genes, the mutation involves selecting, at random, one of the allowable allele values, 
typically a value that is different from those of the two parents. In the case of integer and 
real-valued genes, a Gaussian noise is added to the gene value obtained after the 
crossover operation is complete. Again, a check is performed to make sure that the 
mutated value is within the allowable range; if it falls outside that range, it is set equal to 
the upper or lower limit, as appropriate. Another case, not used in this example (the polo 
shirt) is where a gene is encoded as a binary bit or string. An example would be a design 
feature such as a logo or rings around the sleeves, which are turned on or off, depending 
on whether that bit is enabled or not. In that case, a mutation would simply involve a bit 
flip. 

[0146] Mutation, as described so far, is only applied after a breeding event, and a 
breeding event is only triggered by a thumbs-up vote. A refinement to the implementation 
is triggered when no thumbs-up votes are generated, to prevent the evolutionary process 
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from stagnating. In that case, we generate some number of random individuals every time 
a voter submits a set of votes that contain no thumbs-up. The merit attributes for these 
random individuals are generated as described above for initializing the population. The 
R-space attributes for these random individuals are determined as described below, in the 
5 section that discusses re-insertion of voter "picks" . 

Replacement/Removal Policies 

[0147] Once one or more new design candidates (the offsprings) are created, they are 
introduced into the population. In order to do that, a corresponding number of current 
members of the population must be selected for replacement. Various strategies are 
1 0 employed for that purpose, ranging from purely random selection, to relatively intricate 
schemes based on fitness (or lack thereof) and redundancy. (Various ways used to 
measure redundancy and diversity are described later.) In the simple case, a population 
member is chosen at random: a random integer / uniformly distributed between 1 and N 
(the size of the population of design candidates) is generated, and the i' h member of the 
1 5 population is removed and replaced by the offspring. This is repeated as many times as 
the number of offsprings created by a mating event. Another option in the current 
implementation is to bias the removal by fitness, or rather, lack of fitness. In that case, a 
misfitness score is maintained for each member of the population, and that score is either 
used deterministically to remove the member(s) with the highest misfitness score(s), or 
20 stochastically by loading a "roulette wheel" with slices proportional to these misfitness 
scores. A very simple algorithm for computing misfitness scores, one which only relies 
on "thumbs-up" votes, is the following. First, any members of the population of N 
designs that have not been assessed yet, and that therefore have received no votes, are set 
aside and are not candidates for removal. This is to avoid the premature loss of design 
25 candidates, unless absolutely needed (at which point we pick uniformly at random) . 
Next, for each of the remaining members of the population, the rate of "thumbs-ups" is 
computed as the ratio of "thumbs-up" votes received by that entity divided by the total 
number of votes received by it (i.e., the sum of "thumbs-up", "thumbs-down", and 
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"neutral" votes.) Next, the average rate of "thumbs-up" for all members of the population 
is computed, and the population of designs is divided into two groups, those that have a 
"thumbs-up" rate greater then average, and those that have a rate equal to or lower than 
the average rate. Members of the latter group are selected at random for removal, as 
5 needed. 

[0148] A more discriminating removal scheme that uses all three types of votes — 
thumbs-up, neutral, and thumbs-down— is sometimes used in the current implementation. 
In that case, the misfitness m t for the i~ member of the population is computed as a 
weighted sum of that member's thumbs-up, neutral, and thumbs-down rates, as follows: 
10 m t = w dmm ■ R?™ + w" e '" ml ■ R" e "' raI + w" P ■ R? 

where the w' ype terms are the weights for the particular type of vote, and R', ype terms are 
the vote rates of the given type for the z th member, with w dow " > 0 , w" p < 0 , and w nc "" ra ' 
generally positive. For example: 

m, =3-Rf' m +l-Rr'"" -^-K P 

1 5 [0149] Again, design candidates that have not been seen by any of the participants are set 
aside, to prevent their premature elimination (unless absolutely necessary, for example in 
some cases early on in an exercise.) 

[0150] Another variation on the removal policy modifies the contribution to the 
misfitness rating of similar votes, based on whether they were all cast by the same 

20 participant or by different participants. The idea behind this version is to penalize a 
design candidate more if it disliked by a number of different participants, that is, if 
different participants gave it thumbs-down for example, as compared to when it gets the 
same number of thumbs-down from only one participant. In this version, the individual 
votes for each entity are tracked, and the misfitness is computed based on declining 

25 weighting function or schedule for each participant's votes, as in the equation that 
follows: 
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where /w, is the misfitness score of entity /, V i is the total number of votes received up to 
that point by entity /, V t J e is the number of votes of the given type cast by voter; for 

element /, ^ represents the summation over all voters j, and y is a real parameter that 

j 

determines the steepness of an exponentially decreasing weighting function that reduces 
the impact of additional votes cast by the same participant. 

[0151] Another class of removal schemes take into account how redundant a particular 
member of the population is, in addition to its misfitness. The idea here is the following: 
given two entities that are equally unfit, it is preferable to remove the one that is 
genotypically similar to many other members of the population, in order to minimize the 
loss of genotypic diversity in the population. The redundancy computation can be based 
either on the reproduction genes, the feature genes, or both. These computations are 
described in the next section. Given a redundancy value R(P0 for a member of the 
population Pi, its adjusted misfitness value m'i is computed, as: 

m]=R(P,)-m, 

[0152] The next section describes various ways of measuring redundancy, or its opposite, 
diversity. 

Diversity Measurement 

[0153] Diversity measurement techniques are applied to both feature genes as well as 
reproduction genes. We use measures of diversity to dynamically control various 
parameters of the evolutionary algorithm, such as the mutation rate (mutation 
probability), as well as various strategies used in the system, such as the removal (or 
replacement) strategy and the strategies used to populate a participant's voting window 
(which are described later.) 



m. = — 
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Redundancy 

[0154] Diversity in the evolving population of N designs is measured using a metric of 
genotypic (or phenotypic) similarity between pairs of evolving designs ("individuals"). A 
pair-wise similarity metric S(Pi , Pj) is defined, which returns a value between 0 and 1, 
where 1 signifies that Pi and Pj are genotypically (or, alternatively, phenotypically) 
identical. We then use this metric to compute the redundancy of each individual in the 
evolving population with respect to the population as a whole, as follows: 



[0155] An individual with a high redundancy value is relatively common, in the sense 
that there exist many other individuals in the population that are similar to it. These 
redundancy values are used to help maintain diversity by biasing removal policies 
towards more redundant individuals, as explained in more detail below. Redundancy 
values are also used to provide a graphical visualization of genetic (or phenotypic) 
diversity. 

[0156] Two similarity functions are used in the current implementation. One is based on 
the feature genes, the other on the reproduction genes. In the case of the polo shirt, the 
first one uses the first three genes of the genotype. (The first two are categorical genes 
and the third an integer-valued gene.) We define our function S as follows: 



K k 



where P t k denotes the k" 1 gene of an individual i in the population. 



[0157] In the case of the categorical genes, Gl and G2, S' is given by: 
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[0158] In the case of gene 3, which is an integer gene, S' is computed as follows: 



LP 3 — P 3 ' 
_2 

MaxA 3 



where MaxA 3 is the range of gene G3, that is, the difference between the maximum and 
minimum values it is allowed to take. 

[0159] In the case of real-valued genes such as those used for the reproduction variables, 
redundancy or density is computed using the Euclidean distance dy (described earlier) in 
R-space between the different population members, as follows. The redundancy or 
density of the 1 th population member is given by: 

j 

where dy is the distance in R-Space between individuals / and j 9 and 



/ (x) = max 



1 ,0 

threshold • d t 



max J 



where threshold is a constant in the interval (0, 1] and 



rf m =max(^) 

V 



Entropy 

[0160] Population diversity is also measured by computing the Shannon entropy of the 
genotypic (or phenotypic) values in the population. A high entropy value suggests a high 
level of diversity. Entropy-based diversity measurement does not require a metric of 
similarity. We calculate the entropy of each gene independently and also combine the 
results using weighted averaging. To compute the entropy of a gene, we first count the 
frequency with which each possible allele value for that gene appears in the population. 
These frequencies are then plugged into the standard Shannon entropy equation: 
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where H(G k ) is the entropy of gene G fc J] is the sum over all the different values or 

i 

alleles that G* can take, Mj is the number of occurrences in the population of the z th allele 
for that gene, and N is the population size. This can be applied directly to genes Gl and 
G2. For genes that are similar to Gl and G2, but that span a range of many possible 
discrete (but ordered) values, we apply a coarse quantization to obtain a smaller set of 
discrete values. For genes such as gene G3 above, which span a continuous space, we 
convert the continuum into a set of symbols by quantizing the continuum to obtain a set 
of discrete bins and counting the M t occurrences of values that fall in each of these bins. 
[0161] In another possible embodiment, we may compute entropy based upon higher- 
order effects that occur between genes. To do this, we calculate entropy based upon the 
frequency with which each possible w-tuple of allele values appears in the population 
across the n selected genes. 

[0162] Entropy, being a population-wide measure is not used when a particular member 
of the population is sought, as in replacement or when populating a voting window. 
Rather, it is used to track the evolution process, and to adjust global parameters such as 
the mutation probability. 

Clustering 

[0163] In this section, we describe the subject of clustering, which relies on similarity 
measurements, and which is used at different times in the embodiment described here, as 
discussed later. If the function S(P f9 Pj) , described above, indicates the similarity 
between individuals P, and P h then we can define a new function 

D(P i ,P j ) = l-S(P i ,P J ) 
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to indicate the dissimilarity between these two individuals. With the function D, we can 
compute a dissimilarity matrix M 9 where each entry My is the dissimilarity between 
individuals P t and Pj. This matrix is symmetric and has zeros on the diagonal 
[0164] With the matrix M 9 we can apply any number of known clustering techniques to 
group the individuals either according to genotypic similarity or proximity in R-space, 
such as the K-medoid clustering algorithm. The K-medoid algorithm must be told the 
number of clusters to find. If the number of clusters that would best fit the data is not 
known, then the silhouette value of a clustering, can be used to decide how many clusters 
should be sought. 

[0165] We may also cluster the human users based upon their voting behaviors. In this 
case, we measure the correlation in the voting records of any pair of users V% and Vj and 
derive an entry My in matrix M, as follows: 

1 + correlation^ 9 V J ) 



Strategies for Populating the Voting Window 

[0166] The voting window, also referred to as the focus window, is the window presented 
to each voter for the purpose of displaying a set of design candidates and collecting that 
voter's assessment of them. The various policies used to populate the focus window at 
each voting iteration are described in this section. Generally speaking, these policies seek 
to achieve a number of sometimes conflicting goals: a) giving the participant an 
opportunity to explore as much of the design space as possible, and b) giving the 
participant a sense that the system is responsive to his or her votes. 

Voting window mixture policy 

[0167] The voting or focus window mixture policy examines the votes that are submitted 
from a first focus window and determines the number of slots in the next focus window 
(for the participant whose votes the system is currently processing) that will be filled 
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with: a) offspring of design candidates shown in said first focus window, and b) samples 
of design candidates from the general population of design candidates. 
[0168] In the present implementation, all individuals in the focus window that receive a 
thumbs-up vote will parent at least one, but no more than two, offspring. If the number of 

5 thumbs-up votes is less than the number of focus window slots, then the individuals that 
have received a thumbs-up vote will be used to produce a second offspring until each has 
produced a second offspring, or until the slots of the new focus window are filled, 
whichever comes first. For example, if the focus window has six slots, and two 
individuals are given a thumbs-up, then both will parent two offspring, which will fill 

10 four of the six slots of the new focus window. If, instead, four individuals are given a 
thumbs-up, then the first two individuals will each parent two offspring, while the last 
two will each parent one, thus entirely filling the six slots of the focus window. 
[0169] If, once all the thumbs-up votes are acted upon, any slots remain empty, then they 
are filled by sampling the general population of individuals, as described in the next 

15 section. 

[0170] The policy described above is modified slightly when only one offspring is 
allowed for each candidate that receives a thumbs-up (see breeding section above.) 
[0171] An alternative mixture policy used in the current implementation introduces the 
notion of elitism — well known in the Evolutionary Computation literature — into the focus 

20 window, such that some or all of the individuals that receive a thumbs-up are retained in 
the next focus window. Typically, elitism is used in generational versions of evolutionary 
algorithms in order to avoid the disappearance of highly fit members of the population 
across subsequent generations. In this case, we use a similar notion in the focus window 
or voter window. The motivation behind that policy is to provide a sense of continuity 

25 for the participant who might be uncomfortable with the disappearance from the focus 
window of previously preferred design candidates. When thumbs-up voting is used, as 
described in this example, if more entities received thumbs-up than there are elite slots in 
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the next window, random picks are made among those entities that received thumbs-up, 
until the elite slots are filled. 

[0172] Yet another alternative policy in the current embodiment fixes the minimum and 
maximum number of focus window slots that will be allocated for: a) elites (individuals 

5 that have received a thumbs up and that are carried over), b) offspring of those 

individuals that have received a thumbs up, and c) samples of the general population. If 
the number of thumbs-up votes exceeds the number of slots allocated for offspring, then a 
sampling method is invoked such that only some of the recipients of thumbs-up votes are 
able to parent an offspring. Alternatively, we can limit the number of thumbs-up votes 

10 that a user is allowed to make per focus window. Yet another alternative is to create 
offspring for every individual receiving a thumbs up, but not include all the offspring in 
the subsequent focus window (those not appearing in the focus window will still be in the 
general population). 

Focus window sampling 
15 [0173] For focus window slots that are available for samples from the population at large, 

a policy is needed to decide how these candidates are chosen. In the current 

implementation, the simplest policy used is one where we sample randomly, uniformly 

across the population of individuals. This sampling takes place after all offspring 

(parented by the individuals that received a thumbs up) have been inserted into the 

20 population. The sampling procedure does not attempt to prevent the same individual 
from appearing twice in the focus window, nor does it attempt to prevent two distinct 
individuals that are genotypically identical from appearing together in the focus window. 
[0174] An alternative approach is to bias the sampling away from regions of high 
redundancy (redundancy being computed as described in a previous section.) The 

25 advantage of these policies is to allow for greater exploration of the design space by the 
participants, by affording greater diversity in their focus windows. One such policy, used 
in this embodiment, utilizes R-space redundancy to discount how likely a particular 
population member is to be selected. More specifically, roulette wheel selection is used, 
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with the slice given to each of the N members of the population being inversely 
proportional to the redundancy of that member: 



[0175] Another policy uses feature space redundancy (calculated on the basis of the 
feature genes) to bias the sampling, again using the same formula as above. 
[0176] An alternative policy embodied in the present system performs a cluster analysis 
(described above) of the individuals in the population, either with respect to their 
positions in R-space, their genotypic characteristics, or both. Once the clusters are 
determined, the random sampling is conducted such that each cluster is equally likely to 
provide an individual for the open focus window slots, regardless of the number of 
individuals in each cluster. The advantage of this scheme is to allow the participant to 
sample equally from the different species or preference clusters (or aesthetic clusters) that 
are emerging during the exercise (speciation is discussed later.) This is in contrast to 
uniform sampling where, in effect, we sample from every cluster in proportion to the 
cluster size. A related approach is one where we select the representative design 
candidate for each cluster (the centroid or medoid of that cluster.) 
[0177] In yet another policy, we bias the sampling in favor of individuals that have been 
infrequently viewed by that participant. In this case, the probability of a member of the 
population being selected is inversely related to the number of times it has appeared in his 
or her focus window. The probabilities used to load the roulette wheel are given by: 



where m, y is the number of times that design candidate P t has appeared in the focus 
window of participant j, and f(x) is a monotonic function. For example: 





f(m if ) = ml 
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[0178] In a related policy, we bias the selection in favor of individuals with feature 
properties that have been infrequently viewed (based on feature similarity), or in favor of 
individuals in regions of R-space that have been infrequently viewed in the focus 
window. Here too, the probabilities used to load the roulette wheel for selection are given 
by: 



Pr(/» \W') = 



R(P,\W')/ rR(P,\W) 



where R(P, \ W) , the redundancy of population member P t with respect to the t th focus 
window W 1 (W 1 being the current window, W 2 the previous window, etc.) of the given 
participant is given by: 

q 

where £ is the summation over all q members or design candidates in the focus 

window, and S{P i ,W i q ) is the similarity between entity P t and the q th member of focus 
window W. Finally, S, the similarity function, is computed using any of the methods 
given in the previous section on redundancy and similarity, as appropriate. 
[0179] A variation on this policy is one where we track not only the last focus window, 
but the last few or n focus windows and where we either give all of them equal weight or 
give the content of the more recent focus windows greater importance in the redundancy 
calculations. One particular version of this looks at the last n focus windows (w=3, e.g.), 
and weights them differentially. The slices or shares used in the roulette wheel in this 
case are given by: 

"(if 1 ^ 

Q(P.) = 5>, •{ R(Pj]W > ) / ^RtflW); 

with the weighting factors co, decreasing with 
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as an example. 

[0180] In yet another sampling policy, used with in this implementation, we bias the 
sample away from individuals that are redundant (either based on feature space similarity 
or on reproduction space similarity, or both) with respect to individuals that have been 
given a thumbs-down vote by the participant whose focus window is being populated. 
This is intended to minimize the chances of subjecting that participant to design 
candidates that he or she already voted down. This is done in a manner similar to the 
ones described in the previous policy, except in this case, the redundancy used is not 

| W) but R(P i | W domJ ) , which is computed only with respect to those focus 
window members that received a negative vote from the participant in question. A 
related policy is one where we bias the sample towards individuals that are redundant 
(either in feature space, reproduction space, or both) with respect to individuals that have 
been given a thumbs-up vote (alternatively, a neutral vote) by the user whose focus 
window is being populated. In that case, R(P f \ W upJ ) is used, the probabilities or shares 
used in the roulette wheel are directly proportional to redundancy, as opposed to inversely 
proportional; for example: 



i 

[0181] Yet another policy attempts to maximize the diversity in the focus window with 
respect to the genetic content of design candidates (either based on feature genes, 
reproduction genes, or both) with each subsequent sample being biased away from the 
properties of the individuals placed into the focus window up to that moment. The 
rationale is to increase diversity in the participant's focus window. 
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[0182] Any of the policies mentioned above, or variations thereof, can be employed to 
populate a participant's window when that participant returns after being away from an 
ongoing exercise for a while. Another policy used specifically for that purpose involves 
reloading a returning participant's window with the same candidates that were present in 
his or her last focus window when they last logged off This policy is often problematic 
however, as these candidates are likely to have been removed from the population, 
necessitating that they be recreated and re-inserted in the population. An alternative is to 
present the participant with as broad a sampling of the current design population as 
possible. This is done by sampling from cluster representatives as described earlier. This 
policy is also used in the case of a participant who joins the exercise after it has been 
ongoing for some time, and who is not identified with any particular preference segment. 
[0183] In one embodiment certain refinements are added to the voting window, which are 
intended to provide the participant with some or all of the following: a) a measure or 
indication of progress during the exercise; b) a sense of accomplishment as goal posts are 
reached during the exercise; c) more direct control over the evolution process; d) a sense 
of membership in a community of co-participants in the design process. FIG. 7C shows a 
voting window with two of these refinements on the right hand side. These include a 
progress bar 780 that covers a range from 0% to 100%, and that indicates the level of 
progress with a colored section. The other refinement shown in the same figure is the 
"pick panel" 788, which is the panel on the right hand side of the voting window, under 
the progress bar, labeled "Marker Designs". In the figure, the picks panel shows three 
thumbnails arranged vertically, one of them with a selection in it, and the other two still 
blank. The picks panel displays particular design candidates at certain points during the 
exercise, based on one of the strategies described below. In the case shown, an "X" mark 
under the selected pick allows the participant to remove said pick and to restart that part 
of the exercise that resulted in that particular pick. 

[0184] Four classes of strategies may be used in this embodiment. The first class of 
strategies relies on a fixed number of votes submitted by the participant; a second class 
depends on the degree of similarity among the candidates that are showing up in the 
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participant's last few voting windows, and therefore may involve a variable number of 
voting submittals by the participant in question. A third class allows the participant to 
directly select one of the design candidates in the voting windows a pick, by using a 
special button next to the thumbs-up and thumbs-down button (not shown in this figure.) 
Finally, a fourth class of strategies are intended to use the pick panel to show the 
participant how other participants are voting. 

Strategy I: Analyze a preset number of votes and pick 

[0185] In this strategy, the system is set to allow each participant to view and assess a 
preset number n of voting windows, with typical values of n ranging between 6 and 40. 
In this case, the progress bar increases in proportion to the ratio of voting windows 
viewed by the participant up to that point, to the preset number n. After the n vote 
submittals, a pick is automatically made on behalf of the participant based on his voting 
patterns, as described below, and the progress bar is reset to zero, a new voting window 
populated at random from the population of designs at large, and a new set of n vote 
submittals is started. The voting window shown in FIG. 7C corresponds to a case where 
the participant is asked to go through three sets of n vote submittals, resulting in three 
picks. 

[0186] After the preset number n of voting windows, an analysis is performed on that 
participant's votes on these n windows (all the votes may be examined or only the last 
80% of the n submittals may be examined to remove any "training" or accommodation 
effects.) In one scheme, the analysis involves counting the thumbs-up votes received by 
each allele, and using the counts to generate the most "selected" combination of attributes 
values. At that point, a design candidate is assembled using these most selected attribute 
values, and it becomes the pick. This approach works well when there are few or no 
dependencies between genes. A more refined analysis that works well even if there are 
dependencies involves the following steps: After the n vote submittals have been 
received, all candidates in these voting windows that have received a positive vote 
(thumbs-up) are collected. Then, a first positive-vote-candidate is selected, and, starting 
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with the first gene of that candidate, a count of how many of the other positive-vote- 
candidates share the same allele for that gene is performed. This is repeated for all the 
genes of the selected candidates, and these k counts (k being the number of genes) are 
added up; this count is the "representativeness" score for that candidate. This process is 
repeated for every one of the positive-vote-candidates, and these are ranked on the basis 
of their score. Of those, the top-ranking positive-vote-candidate is selected as a pick. 
[0187] In one variation, the participant is given a chance to reject the chosen pick, in 
which case the next highest scoring one is selected as a pick, and so on. If several (for 
example, three) are rejected, that set of n iterations is restarted. In another variation, the 
participant is presented with a panel showing the three highest scoring pick candidates, 
and he is given the opportunity of choosing the one he deems closest to what he had been 
voting for. 

Strategy II: Focus window convergence pick 

[0188] In the second class of progress indication strategies, the progress bar does not 
increase monotonically, but it might regress depending on the behavior of the participant. 
If a voter votes consistently, then it is more likely that his successive voting windows will 
be populated with increasingly similar design candidates; in that case, a progress bar tied 
to the similarity of the contents of these successive voting windows will increase. In this 
case, the number of vote submittals prior to a pick selection is variable. As some fraction 
(say, %) of the design candidates in the voting window became identical or very similar, 
the most duplicated candidate is chosen as a pick. Having made the pick, and if the pick 
is not rejected by the participant, a new focus window is populated (e.g., at random), and 
the participant starts the next phase of the process that will yield the next pick. If the pick 
is rejected, alternatives similar to the ones presented above under Strategy I are followed. 

Strategy III: Direct selection 

[0189] In this case, after a certain number of voting submittals have been made by the 
participant, an additional button is enabled next to each of the design candidates in the 
focus window. That button is a direct pick button, which allows the participant to select 
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the corresponding candidate to become a pick. Alternatively, when direct picks are 
enabled, the participant is allowed to drag the desired candidate from its location in the 
voting window onto the picks panel area, which will place a copy of it there. Once the 
participant makes a direct pick, the direct pick buttons are again disabled for a preset 
number of voting iterations. The pick panel has a fixed number of slots to hold the picks, 
and when a new pick is inserted by clicking its direct pick button, it gets placed at the top 
of the Pick Panel, while everything else moves down one slot, the design occupying the 
bottom slot being discarded. If the pick is made by dragging it onto the pick panel, then 
the picked design either replaces the item in the slot onto which it is dragged and 
dropped, or the items at that slot and below are shifted down one slot (item in bottom slot 
again discarded). No matter how the pick panel is managed, the history of all picks is 
recorded for subsequent analysis. 

[0190] A variation on this scheme also allows the participant to reinsert one or more of 
the picks in the pick panel back into the population of design candidates (and therefore in 
his focus of voting window as well) later in the exercise, if the participant gets the 
impression that that design candidate may have been lost. In that case, the R-space values 
of that candidate are updated to reflect the changes that may have taken place in R-space 
in the interim. One cannot rely on that candidate's previous R-space coordinates to be 
compatible with the current configuration of R-space, since R-space is constantly in flux. 
A new R-space location can be chosen in one of the following ways: 

1) A region of R-space that contains the designs that are most similar to the design 
we wish to re-insert is located, based on feature gene similarity; the re-inserted 
design is given new R-space coordinates that place it in that neighborhood. If no 
designs in the population are sufficiently similar to the design to be reinserted 
then: 

a) Pick R-space coordinates at random, but within the bounding box of the 
current population (optionally expanded by some amount); 

b) Place the re-inserted individual in the least dense region of R-space. 
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c) Place the re-inserted individual at the periphery of the populated regions of 
R-space at a distance from the periphery determined by the average 
distance between cluster representatives. 

Strategy IV: Social network effect schemes 

[0191] This is a family of strategies that involve showing the participant, in a pick 
window, not only the pick candidates estimated based on his voting patterns, but also the 
picks (candidates or actual) for other voters. In this case, the most popular design 
candidate across voters is estimated using the same techniques described under Strategy I 
above, except that the positive-vote-candidates are collected from all participants, not 
only from the participant whose voting window we are discussing. 

Speciation and dynamic (or co-evolutionary) segmentation 
[0192] When the (3 parameter used to control mate selection (Eq. (3)) is set to a high 
enough value, such as 40.0, then the mechanisms and procedures outlined above will 
automatically allow different preference profiles to emerge and to coexist during the 
process. (In case Eq. (5) is used instead, then the y parameter needs to be small enough.) 
To the extent that the participants represent a population of consumers in a market, and to 
the extent that different subgroups in that market end up evolving preferences for distinct 
combinations of product attributes, then the system in effect performs a sort of dynamic 
segmentation of that market. The term "dynamic" is used here to indicate that the 
preference profiles and the corresponding preferred designs are co-evolved during the 
process. This is different from existing approaches to market segmentation, which either 
assume given preference profiles (for which appropriate design are developed), or given 
designs for which the appropriate customers are identified. This section is intended to 
explain how the current implementation affords that segmentation capability, and to 
present a simple example. 
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Assortative mating 

[0193] To the extent that crossover operations between certain individuals (design 
candidates) results in new candidates that are less preferred by the participants, we seek to 
prevent such mating from occurring. However, we do not know a priori which such 

5 matings will be deleterious. The R-space mechanisms that express individuals' mate 
choices can learn, over time, which mate pairs are compatible and which are not, based 
upon the assessment by participants of the outcomes of actual matings. Pairings of 
genetic material that are successful will gradually tend to occur more frequently and, 
thereby, crowd-out those pairings that are less successful The prohibition (or reduced 

1 0 likelihood) of certain mate pairs is known as assortative mating, and each set of 

individuals that are allowed to mate with each other, but not with members of another set, 
is known as a species. 

[0194] The evolution of species (speciation) is of direct importance to dynamic 
participant preference segmentation. When a design exercise begins, the R-space is 

1 5 homogenous: the R values of the population of design candidates are distributed 
uniformly in R-space. As evolution proceeds, information is gained (through the 
participants' feedback) about which pairings of genetic material are more successful than 
others. As a result of participants' assessments and the crossover operations on the 
reproduction genes, the distribution of the gene values in R-space becomes 

20 heterogeneous. In other words, the R-space begins to cluster. This heterogeneity is 
structured in a way that keeps certain individuals near each other and far from others. 
These clusters correspond to species, that is, sets of individuals that are reproductively 
isolated. As reproductive isolation emerges, each species, along with the participants who 
have evolved it through their voting, become specialized to a particular sub-region of the 

25 design space, and they are less subject to interference from other species. 

Multiple niches in an ecology 

[0195] When a market has multiple segments, there exists a set of distinct preference 
profiles for each of these segments. Each segment's preference represents an area in the 
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design search space. These areas can be thought of as distinct ecological niches. The 
assortative mating dynamic allows multiple species to emerge and persist, where each 
species inhabits its own niche. The number of participants supporting each segment — a 
proxy for the size of that market segment — determines the carrying capacity of that niche, 

5 and thus the size of the corresponding species. In other words, as R-space clusters form, 
the size of a cluster (the number of design candidates that belong to that particular 
species) reflects the size of the market segment (assuming a balanced level of voting 
among participants, which can be controlled in the current implementation, either by 
limiting the number of voting screens presented to each participant, or by disregarding the 

10 votes submitted by a given participant that participant has reached his or her allotted 
number of votes.) Because the participants discover design possibilities as they interact 
with the system (and thereby form opinions), and the designs evolve in response to the 
participants, one can describe the interaction between designs and participants to be in 
some sense co-evolutionary. The preferences evoked by the evolving designs allow the 

15 system as a whole to converge on a set of designs that delineate multiple segments in the 
market. 

[0196] FIGs. 8 through 14 present an example of this dynamic segmentation process. In 
this example, two participants interacted with the system concurrently. The process starts 
with uniformly distributed reproduction genes and feature genes (see FIGs. 8 and 9, 

20 respectively) based on a random seeding of the population of candidates. After a number 
of voting cycles, two segments emerge, one corresponding to participant 1, and the other 
to participant 2. FIGs. 10 and 1 1 show the focus windows for the two participants at that 
point in the exercise. The content of each focus window is dominated by the design of 
choice for that participant, that is, the design choices shown to the first participant may 

25 feature different colors, patterns, and design styles (e.g. tab length) than the design choice 
presented to the second participant. The design choices shown to either particpant may 
be highly concentrated in R-space, that is, each design choice may be very similar to each 
other design choice shown to that particpant (e.g. similar colors, similar patterns, etc.). In 
other exercises the design choices presented to participants may be scattered in R-space, 
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that is, each design choice may have a different color or pattern from other design choices 
being presented to the particpant FIG. 12 shows the R-space plot at that point, with the 
design candidates corresponding to the two segments highlighted; in this embodiment, the 
two clusters are clearly distinguished. Finally, FIGs. 13 and 14 show the distribution of 

5 feature gene values for each participant at that point in the process. FIGs. 1 3 depicts the 
distribution of feature genes 1 though 3 for participant 1. Style "2" is the only surviving 
collar style, since it is preferred by both segments. Participant 1 prefers a purplish body 
style (body style "1") and a short tab length (value equal to 123). 
[0197] FIG. 14 depicts a distribution of feature genes for participant 2. Collar style "2" 

10 (tab collar) is the only surviving collar style. Participant 2 prefers a green body style 
(body style "6") and a long tab length (value equal to 1310). 

[0198] In one embodiment, the demographic information collected about each user may 
q be used to alter the evolutionary algorithm described above. For example, a system may 

^ accept input from a wide universe of users but only use input from a set of users having a 

W 1 5 particular demographic for the purposes of evolving the universe of design objects. This 

s ~ ; 

in embodiment allows the manufacturer to determine the preferences of a particular market 

iy segment without requiring the manufacturer to affirmatively direct a market research 

M effort at a particular demographic market. 

p [0199] In another embodiment, the system described above may be used to permit data to 

20 be gathered concerning competitive products. This is accomplished by including 
competitive products in the set of products designed to see if they "survive." In one 
particular embodiment, the evolutionary algorithm recognizes when a competitive 
product is genetically similar to a set of product designs selected by one or more selectors 
and inserts the competitive design into the next generation of product choices. 
25 [0200] In still another embodiment, the systems described above are used as a "virtual 
sales person," that guide a consumer in determining one or more products for purchase. 
In this embodiment, the product designs represent the universe of items for sale by a 
company and successive generations of products are selected from the universe based on 
received user input. 
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[0201] In still another embodiment, the evolutionary design system includes information 
from commercial actors that supply raw materials to the manufacturer. For example, a 
supplier may provide information concerning handles available for inclusion in a product. 
The information typically will include dimension information and style information, but 
may also include pricing information. In this embodiment, a selector may be provided 
with information regarding the cost of a potential design and that genetic factor may be 
considered in creating the next generation of products for review by the selector. 
[0202] In yet another embodiment, the evolutionary design techniques described above 
are enhanced by providing to selectors simulated endorsement data or other promotional 
schemes and strategies. In this embodiment, selectors that are perceived as opinion 
makers may have their voting preferences displayed to the voting public to determine if 
other selectors change their votes based on the knowledge of the opinion-makers voting 
preferences. 

[0203] Many alterations and modifications may be made by those having ordinary skill in 
the art without departing from the spirit and scope of the invention. Therefore, it must be 
expressly understood that the illustrated embodiment has been shown only for the 
purposes of example and should not be taken as limiting the invention, which is defined 
by the following claims. The following claims are thus to be read as not only literally 
including what is set forth by the claims but also to include all equivalent elements for 
performing substantially the same function in substantially the same way to obtain 
substantially the same result, even though not identical in other respects to what is shown 
and described in the above illustrations. 
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